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INHIBITORS OF HIV MEMBRANE FUSION 

RELATED APPLICATIONS 

This application is related to U.S. Provisional Application 60/043,280, 
entitled Core Structure of gp41 from the HIV Envelope Glycoprotein, by David C. 
Chan, Deborah Fass, Min Lu, James M. Berger and Peter S. Kim, filed April 17, 
5 1997 and U.S. Application 09/062,241, entitled Core Structure of gp41 from the 
HIV Envelope Glycoprotein, by David C. Chan, Deborah Fass, Min Lu, James M. 
Berger and Peter S. Kim, filed April 17, 1998. The present application claims the 
benefit of U.S. Provisional Application 60/094,676, entitled Inhibitors of HIV 
Membrane Fusion by David C. Chan, Debra M. Ehrgott and Peter S. Kim, filed July 

10 30, 1998; U.S. Provisional Application 60/100,265, entitled Inhibitors of HIV 
Membrane Fusion, by David C. Chan, Debra M. Ehrgott and Peter S. Kim, filed 
September 14, 1998 and U.S. Provisional Application 60/101,058, entitled Inhibitors 
of HIV Membrane Fusion, by David C. Chan, Debra M. Ehrgott and Peter S. Kim, 
filed September 18, 1998; and U.S. Provisional Application 60/132,295, entitled 

15 Inhibitors of HIV Membrane Fusion, by Debra M. Ehrgott, David C. Chan, Vladimir 
Malashkevich and Peter S. Kim, filed May 3, 1999. The entire teachings of these 
referenced applications are incorporated herein by reference. 

GOVERNMENT SUPPORT 

The invention was supported, in whole or in part, by National Institutes of 
20 Health Grant Number P01 GM56552. The United States Government has certain 
rights in the invention. 

BACKGROUND OF THE INVENTION 

Structural studies of proteins from human immunodeficiency virus type 1 
(HIV-1) have been essential in the development of anti-retroviral drugs. .Structure- 
25 based drug development has been most intense for reverse transcriptase inhibitors and 
protease inhibitors, the two classes of HIV-1 drugs in clinical use. It would also be 
useful to be able to carry out structure-based drug development against HIV entry. 
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SUMMARY OF THE INVENTION 

As described herein, the cavities on the surface of the N-helix coiled-coil of 
HIV envelope protein gp41 subunit (e.g., HIV-1 envelope protein gp41-subunit) are 
targets for drugs or other agents which, by binding the coiled-cbil surface, 
5 particularly the cavities, inhibit HIV entry into cells. This is useful as the basis for 
identifying and designing drugs or agents which inhibit entry of HIV (e.g., HIV-1, 
HIV-2) into cells. 

Results described herein show that the coiled-coil cavity (also referred to as 
the hydrophobic pocket) in the gp41 core is an attractive drug target and that 

10 molecules which bind the cavity interfere with (inhibit) HIV infectivity (HIV entry 
into cells). Applicants have shown, for the first time, that conserved residues 
projecting into the hydrophobic pocket clearly play a major role in the ability of C34 
to inhibit HIV-1 infection. The importance of cavity contacts (between the N-helix 
coiled-coil cavity and residues of the C peptide region of gp41) to gp41 function is 

15 clear. Conversely, the importance of preventing such cavity contacts in inhibiting 
gp41 function and, thus, inhibiting HIV-1 entry into cells, is also clear. In addition, 
directing drugs against the hydrophobic pocket of the central-coiled coil of gp41 
targets one of the most highly conserved regions of the HIV-1 envelope proteins, 
which means that drugs which target the coiled-coil surface, and particularly its 

20 hydrophobic pocket, will have broad activity against diverse HIV isolates and that it 
will be difficult for drug-escape mutants to emerge. 

A variety of methods, such as mirror-image phage display techniques (T. N. 
Schumacher, et ai, Science, 277:1854 (1996)), combinatorial chemistry (A. 
Borchardt, S. D. Liberies, S. R. Biggar, G.R. Crabtree, S.L. Schreiber, Chem. Biol, 

25 4:961 (1997); J.C. Chabala, Curr. Opin. BiotechnoL, 6:632 (1995)), rational drug 
design and other drug screening and medicinal chemistry methods can be used to 
identify D-peptides, peptidomimetics and small molecules that bind the coiled-coil 
cavity with sufficient affinity to inhibit HIV-1 infection. The close correlation 
between N36/C34 stability and C34 potency, described herein, suggests that the 

30 effectiveness of such compounds will depend critically on the strength of their 
cavity-contacts. As described herein, candidate compounds can be tested for their 



WO 00/06599 



PCT/US99/17351 



ability to interfere with formation of a stable complex between C34 and N36 or their 
ability to disrupt binding of the two (disrupt the complex), thereby providing rapid, 
quantitative screens to identify and evaluate potential inhibitors of HIV- 1 entry. 
Alternatively, screening can be carried out to identify molecules or 
5 compounds which interfere with or disrupt binding of the N-helix coiled-coil cavity 
and a peptide which binds the cavity, thus providing methods of identifying 
molecules which are "pocket specific" binding agents or drugs. Molecules and 
compounds described herein (also referred to as drugs or agents) are useful to 
inactivate gp41 and, thus, prevent or reduce (inhibit) HIV-1 entry into cells. 

10 Without wishing to be bound by theory, it is reasonable to propose that these 

inhibitors bind to the pre-hairpin intermediate of gp41 and prevent its conversion to 
the trimeric hairpin structure of the gp41 core which corresponds to the fusion-active 
state of gp41. (Chan, D.C. and P.S. Kim, Cell 93:681 (1998), See Figure 1). Thus, 
the present methods are useful to identify drugs or agents which inhibit (totally or 

15 partially) formation of the fusion-active state of HIV-1 gp41 envelope protein. In 
the method, the ability of a candidate inhibitor (also referred to as a candidate drug), 
which can be any type of compound or molecule, such as a small molecule (e.g., a 
small organic molecule), a peptide (a D-peptide or an L-peptide), a peptidomimetic, 
a protein or an antibody, to bind the N-helix coiled-coil of gp41 and form a stable 

20 complex is assessed. Compounds or molecules which bind to the N-helix coiled- 
coil are further assessed for their ability to inhibit gp41 function (inhibit membrane 
fusion), such as through HIV-1 infection (viral entry) and syncytium assays, 
representative models of which are described and referenced herein. Those agents 
shown to inhibit gp41 function through such assays can be further assessed for their 

25 activity in additional in vitro assays and in appropriate animal models (e.g., Letvin, 
N.L., Science, 280, (5371): 1875 - 1880 (1998), Hirsch, V.M. and P.R. Johnson, 
Virus Research, 32 (2): 183-203 (1994); Reimann, K.A. et ai, J. VivoL, 70 (10): 
6922-6928 (1996)). Any suitable approach can be used to assess binding of 
candidate inhibitors to the N-helix coiled-coil and, as a result of the work described 

30 herein, to the N-helix coiled coil cavity. In one embodiment, the ability of a 

candidate inhibitor to bind the synthetic peptide N36 (described in Lu, M. et ai, 1 
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Biomol Struct Dyn. 15: 465 (1997), Chan, D.C. et al, Cell, 89, 263 (1997) and U.S. 
Provisional Application 60/043,280, entitled Core Structure of gp41 From the HIV 
Envelope Glycoprotein, by David C. Chan, Deborah Fass, Min Lu, James M. Berger 
and Peter S. Kim, filed April 17, 1997) is assessed. The stability of the resulting 
5 complexes is assessed using methods described herein. 

In a particular embodiment of the method of identifying compounds or 
molecules (drugs or agents) which bind the N-helix coiled-coil cavity, a soluble 
model that presents the gp41 coiled-coil cavity is used. The six helix bundle of HIV 
gp41 consists of an internal trimeric coiled-coil, composed of three identical N- 
10 peptides, surrounded by three C-peptides which fit into a conserved hydrophobic 
groove on the outside of the trimeric coiled-coil. The C-terminal end of the trimeric 
coiled-coil contains a large cavity into which bulky hydrophobic groups from the C- 
peptide pack. This hydrophobic pocket is used as the target for anti-HIV drug 
discovery and/or design. Unfortunately, in the absence of the C-peptide, the N- 
15 peptide is aggregated and not 100% helical. Thus, simply using an N peptide from 
HIV-1 gp41, such as N36, N51 (Lu, M. et al, Nature Struct Biology, 1995) or DP- 
107 (Wild et al., PNAS 89: 10537- 10541 (1992) is unlikely to provide an effective 
model for the N-helix coiled-coil. 

As described herein, Applicants have succeeded in producing a soluble, non- 
20 aggregating trimeric peptide model of the hydrophobic pocket of HIV gp41 and, 
thus, for the first time, have provided a model that properly presents this 
hydrophobic pocket or cavity (in a manner or configuration which forms a similar 
structure to the corresponding residues in the HIV gp41 structure). (The terms 
"pocket" and "cavity" are used interchangeably.) As described, a peptide (also 
25 referred to as a fusion protein) which includes a soluble, trimeric coiled coil portion 
and a portion from the N-peptide region of HIV gp41 that includes the amino acid 
residues which form the pocket or cavity of the N-helix coiled-coil of HIVgp41 (the 
pocket-comprising residues of the N-peptide) has been produced and shown to be 
such a soluble model, useful to identify molecules or compounds which inhibit HIV 
30 gp41 function and, thus, HIV entry into cells. The trimeric version of the coiled-coil 
in the peptide (also referred to as a fusion protein) can be the coiled-coil region of a 
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protein which is not a protein of HIV (a non HIV protein, such as GCN4-pI Q I) or a 
protein of HIV origin (a protein derived from HIV or having the same or a similar 
amino acid sequence as an HIV protein). In a specific embodiment, the soluble, 
non-aggregating trimeric peptide model of the large cavity, referred to as IQN17, 
5 comprises a trimeric version of the coiled-coil region of GCN4, the yeast 

transcription activator, and a portion of the C-terminal end of the N peptide of gp41 . 
IQN17 contains 29 residues of GCN4-pI Q I (formerly referred to as GCN4-pIQ in 
U.S. Provisional Application 60/101,058) (Eckert, D.M. et al. J, Mol Biol., 
254:859-865 (1998)), including three mutations for increased solubility, and 17 

10 residues of HIV; there is a one residue overlap between the two proteins, making the 
total length of the fusion protein 45 residues. The sequence of GCN4-pI Q I is: ac- 
RMKQIEDKIEEI LSKQYHIENEIAR IKKLIGER (SEQ ID NO: 1). The HIV 
Sequence is: LLQLTVWG IKQLQARIL (SEQ ID NO:20). The sequence of IQN17 
is: ac-RMKQIEDKIEEIESKQKKIENEIARIKK I J.QITVWOIKQLQARIL -am 

15 (SEQ ID No:2). The HIV portion is underlined in SEQ ID No: 2; ac- represents an 
N-terminal acetyl group and -am represents a C-terminal amide. The sequence of 
the soluble, trimeric version of the coiled-coil region of GCN4 (referred to as a 
soluble, trimeric coiled coil of GCN4) in IQN17 is: 

RMKQIEDKIEEIESKQKKIENEIARIKK (SEQ ID No: 25). The superhelix 

20 parameters such as rise and pitch (Harbury, P.B. et ai, Nature 377:80-83 (1994); 

Harbury et al. y PNAS 92:8408-8412 (1995)) of the GCN4-pIqI coiled coil are nearly 

identical to the HIV gp41 N-helix coiled coil. Therefore, the resulting fusion protein 

molecule (IQN17) is predicted to form a long trimeric coiled coil, which presents the 

N-peptide hydrophobic cavity at the C terminus. IQN17 is fully helical, as 

25 determined by circular dichroism, with a molar ellipicity at 222nm of -36,000 deg 
2 

cm dmol- 1 . As determined by sedimentation equilibrium, IQN1 7 is close to a 
discrete trimeric species with a ratio of observed molecular weight to calculated 
molecular weight ranging from 3.00 to 3.16 times the monomer molecular weight at 
a concentration of 20 /^M. As determined by X-ray crystallography, IQN17 presents 
30 the N-peptide hydrophobic pocket in a manner that is nearly identical to the pocket 
in the HIV gp41 N-helix coiled coil. 
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The IQN17 molecule (in the natural L-handedness or enantiomeric D- 
handedness) can be used in screens, including high-throughput drug screens, to 
identify molecules that bind to the coiled-coil pocket. The IQN17 molecule, in the 
D-handedness, has been used as a target in mirror image phage display (Schumacher 
5 et aL Science, 271: 1854 1 1996) to identify small molecules (D-peptides) which 
bind to the hydrophobic pocket of gp41 (in the natural L-handedness) and inhibit 
HlV-membrane fusion. The desired target (the N-helix of HIV gp41 which includes 
the hydrophobic pocket) is the exact mirror image of the naturally-occurring target. 
It is used to screen a library or collection of compounds or molecules which are to 

1 0 be assessed for their ability to bind the mirror image of the naturally-ocurring coiled- 
coil pocket. The mirror image of a compound or molecule found to bind the mirror 
image of the naturally-occurring gp41 pocket, will bind the gp41 pocket in the 
natural handedness. The library or collection screened can be of any type, such as a 
phage display library, peptide library, DNA library, RNA library, combinatorial 

1 5 library, collection of chemical agents or drugs, cell lysate, cell culture medium or 
supernatant containing products produced by cells. In the case of a phage display 
library, the D-target is used to screen phage coat proteins. Specific phage clones that 
bind to the target are identified and the mirror images of the expressed proteins are 
chemically synthesized with D-amino acids. By using IQN17 in mirror-image 

20 phage display, D-peptides that bind to the gp41 hydrophobic pocket have been 

identified. Further assessment has been carried out, as described, to demonstrate the 
ability of D-peptides to inhibit HIV gp41 function. D-peptides which bind the gp41 
hydrophobic pocket and inhibit HIV infectivity have been identified. D-peptides 
which bind the hydrophobic pocket also will serve as lead molecules for drug 

25 development and/or reagents for drug discovery (where the drugs bind to the coiled- 
coil pocket and inhibit HIV infectivity). The IQN17 molecule, in the natural L- 
handedness, can be used in screens, including high-throughput screens, to identify 
molecules that bind to the coiled-coil pocket. IQN17 can be used to screen a 
collection or library of compounds or molecules which are to be assessed for their 

30 ability to bind the hydrophobic pocket. The library or collection screened can be of 
any type, such as a phage display library, RNA library, DNA library, peptide library, 
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combinatorial library, collection of chemical agents or drugs, cell lysate, cell culture 
medium or supernatant containing products produced by cells. Compounds or 
molecules which bind the hydrophobic pocket also will serve as lead molecules for 
drug development and/or reagents for drug discovery. 
5 Fusion proteins which are variants of IQN1 7 can be produced and used to 

screen for drugs which bind the gp41 N-helix coiled-coil pocket. Any of a wide 
variety of variations can be made in the GCN4-pIqI component of IQN17 and used 
in the method, provided that these changes do not alter the trimeric state of the 
coiled-coil. For example, the amino acid composition of the GCN4 component can 

1 0 be changed by the addition, substitution, modification and/or deletion of one or more 
amino acid residues, provided that the trimeric state of the coiled-coil is maintained. 
For example, the Asp residue in IQN17 (at a "f-position" of the coiled coil) can be 
replaced by any of the naturally-occurring amino acids. (O'Neil and DeGrado, 
Science 250:646 (1990)). Alternatively, this component of the fusion protein can be 

1 5 a trimeric version of the coiled-coil region of another protein, such as that from 
Moloney Murine Leukemia Virus (Fass, D. et al. Nature Struct. Biology, 3:465 
(1996)), GCN4-pII (Harbury et al, Nature, 5/7:80, 1994) or the ABC heterotrimer 
(Nautiyal and Alber, Protein Science 5:84 (1999)). 

Changes can also be made in the amino acid composition of the fusion 

20 protein component which is the C-terminal portion of the HIV gp41 N peptide to 
produce IQN1 7 variants. The C-terminal portion can be changed by the addition, 
substitution, modification and/or deletion of one or more amino acid residues. The 
amino acid composition of either or both components of the fusion protein can be 
altered, and there is no limit to the number or types of amino acid residue changes 

25 possible, provided that the trimeric state of the coiled-coil and the hydrophobic 

pocket of the N peptide of HIV gp41 are maintained. IQN17, IQN17 variants or any 
soluble model of the large cavity can be used to screen for drugs which bind the N- 
helix coiled-coil, especially the pocket, or for lead drug candidates or candidates for 
use in vaccine preparations, to be further screened using methods known to those of 

30 skill in the art, such as in a high throughput format. 

Results described herein are useful to screen for inhibitors of HIV gp41 
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which are variants of C34 as described below. Once a variant of C34, such as a C34 
variant which stably binds N36, has been identified, it can be used and further 
assessed as obtained or it can be modified (e.g., by altering, adding, deleting or 
substituting at least one amino acid residue or adding a non-amino acid substituent), 
5 if desired or needed (e.g., to enhance stability, solubility, bioavailability). 

Alternatively, a C34 variant can be assessed to determine if a shorter component 
(region of fewer amino acid residues) also is active as an inhibitor. As discussed 
herein, the three C34 residues Trp 628 , Tip 631 and He 635 that pack into the deep, 
conserved pocket in the N36 trimer are critical for inhibitory activity. The 

1 0 observation that C34 variants that have a higher affinity for the N36 coiled-coil have 
more potent inhibitory activity against HIV infection forms the basis for screens to 
identify and evaluate potential inhibitors. For example, using the "split-synthesis" 
technique (Chen, C.L., et al Methods Enzymoi 267:211-219 (1996); Lam, K.S. et 
al. y Nature, 354: 82-84, (1991)) of combinatorial peptide chemistry, a library of C34 

1 5 variants is synthesized in which the three critical hydrophobic residues are randomly 
replaced by chemical substitutions of varying hydrophobic character. This synthesis 
technique results in the generation of a vast library of beads, each containing many 
copies of a single variant C34 peptide (i.e., a "one-bead, one-compound" type of 
library). To identify C34 variants which stably bind the N-helix coiled-coil, a 

20 labeled version of N36 (or a modified N-peptide) is mixed with the peptide beads 
under conditions (e.g., elevated temperature) that restrict binding to only those C34 
variants with the highest affinity. Binding is measured by detection of the label on 
the N-helix peptide, using known methods. Simple modifications of the 
split-synthesis technique allow ready identification of the selected peptide sequence 

25 by mass spectroscopy (Youngquist, R.S. et al.,J. Amer. Chem. Soc. 117, 3900-3906 
(1995)). The C34 variants selected, particularly those with the highest binding 
affinities for N36, are tested in syncytium and infection assays for gp41 inhibitory 
activity. Truncated versions of these C34 variants, containing only the 
cavity-binding region, can also be tested for inhibitory activity. Alternatively, a 

30 library of other peptides to be assessed can be synthesized to generate a library of 
beads, each containing (having bound thereto) a peptide to be assessed. This library 
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is analyzed as described above for the C34 variants and resulting hits (members with 
appropriate binding affinities for N36) are further analyzed for gp41 inhibitory 
activity. As a second example, the N36 peptide or the soluble variants described 
earlier, such as IQN17, GCN4-N-helix peptide can be used as a target for phage 
5 display or mirror-image phage display techniques to identify peptides that bind to 
the cavity. 

IQN17 can also be used to raise antibodies (monoclonal and/or polyclonal) 
that bind to the coiled-coil cavity. IQN1 7 can further be used, either alone or in 
combination with other materials, in a vaccine, which will elicit the production of 

10 antibodies that bind to the coiled-coil in the individual to whom it is administered 
(the vaccinee), and thereby offer protection against infection and/or disease. 

Peptides, both D-peptides and L-peptides, which fit into a deep hydrophobic 
pocket in the trimeric N-helix coiled-coil of HIV- 1 envelope glycoprotein gp41 are 
also the subject of this invention. The D-peptides are the first molecules that have 

1 5 been shown to bind exclusively to the gp41 hydrophobic pocket. The observation 
that these D-peptides inhibit gp41 -mediated membrane fusion processes (syncytia 
formation and viral infection) provides the first direct demonstration that HIV-1 
infection can be inhibited by molecules that bind specifically to pocket. The 
validation of the gp41 hydrophobic pocket as a drug target sets the stage for the 

20 development of a new class of orally bioavailable anti-HIV drugs, that work by 

inhibiting viral entry into cells. Such drugs would be a useful addition to the current 
regimen used to treat HIV-1 infection with combination therapies. D-peptides, such , 
as the D-peptides described herein, portions, modification and variants thereof and 
larger molecules (e.g., polypeptides) which comprise all or a portion of a D-peptide 

25 described herein, are useful to inhibit HIV membrane fusion and, thus, HIV entry 
into cells. D-peptides, corresponding to the D-amino acid version of phage 
sequences identified as described herein, are inhibitors of HIV-1 infection and 
syncytia formation. The C-terminal residues in these D-peptide inhibitors have the 
sequence pattern: CXXXXXEWXWLCAA-am. (In the phage-display library, the 

30 positions corresponding to the C residues were encoded as either C or S, the 

positions corresponding to the AA residues were encoded as such and the other 10 
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positions (indicated by X) were randomly encoded. The -am represents a C-terminal 
amide, added as part of the peptide synthesis procedure.) The N-terminal residues in 
the D-peptide inhibitors are, for example, ac-GA, ac-KKGA, or ac-KKKKGA. The 
ac- represents an N-terminal acetyl group added as part of the peptide synthesis 
5 procedure. The C-terminal amide and the N-terminal acetyl group are optional 
components of D-peptides of this invention. Other N-terminal residues can be 
included, in place of or in addition to those in the previous sentence, as desired (e.g., 
to increase solubility). For example, D-peptides of the following sequences are also 
the subject of this invention: 

1 0 ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 28); 
ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 29); 
ac-KXKKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 30); 
ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 31); 
ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 32); and 

1 5 ac-KKKKXXCXXXXXEWX WLCXXX-am (SEQ ID NO: 33). 

The amino acid residues are represented by the single letter convention and 
X represents any amino acid residue (naturally occurring or non-naturally occurring) 
or other moiety, such as a modified amino acid residue. 

Further, the ten amino acid residue "core" (the 10-mer which is flanked at 

20 each end by a cysteine residue) of the 12 amino acid residue peptide, as well as 
portions, modifications and variants of the 10-mers are also useful to inhibit 
membrane fusion and entry of HIV into cells. Variants, portions and modifications 
of these peptides are also useful as inhibitors. As described further herein, D- 
peptides which comprise a consensus sequence (e.g., WXWL (SEQ ID NO: 23), 

25 EWXWL (SEQ ID NO: 24), CXXXXXEWXWLC (SEQ ID NO: 12) or a portion 
thereof) have been shown to bind the N-helix coiled-coil and are useful to inhibit 
membrane fusion and entry of HIV into cells. The enantiomeric peptides (D- 
peptides) do not serve as efficient substrates for enzymes, such as proteases and, 
therefore, are more resistant to proteolytic degradation than are L-peptides; they are 

30 also less immunogenic than are L-peptides. 



Specific embodiments of D-peptides of the present invention are: 

(a) CDLKAKEWFWLC (SEQ ID NO: 3); 

(b) CEARHREWAWLC (SEQ ID NO: 4); 

(c) CELLGWEWAWLC (SEQ ID NO: 5); 

(d) CLLRAPEWGWLC (SEQ ID NO: 6); 

(e) CSRSQPEWEWLC (SEQ ID NO: 7); 

(f) CGLGQEEWFWLC (SEQ ID NO: 8); 

(g) CMRGEWEWSWLC (SEQ ID NO: 9); 

(h) CPPLNKEWAWLC (SEQ ID NO: 1 0); 

(i) CVLKAKEWFWLC (SEQ ID NO: 1 1); 

(j) KKGACGLGQEEWFWLC (SEQ ID NO: 1 5); 

(k) KKGACELLGWEWAWLC (SEQ ID NO: 1 6); 

(1) KKKKG ACELLGWEWAWLC (SEQ ID NO: 1 7); 

(m) KKGACMRGEWEWS WLC (SEQ ID NO: 1 8); 

(n) KKGACPPLNKEWAWLC (SEQ ID NO: 19); 

(o) a D-peptide comprising WXWL (SEQ ID NO: 23); 

(p) a D-peptide comprising EWX WL (SEQ ID NO : 24); 

(q) a D-peptide comprising CXXXXXEWXWL (SEQ ID NO: 12) 

(r) ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 

(r) ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 38); 

(t) ac-KKKKGACEARHREWAWLCAA-am (SEQ ID NO: 43); 

(u) ac-GACGLGQEEWFWLCAA-am (SEQ ID NO: 44); 

(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 1 5); 

(w) ac-KKKKGACGLGQEEWFWLCAA-am (SEQ ID NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKGACDLKAKEWFWLCAA-am (SEQ ID NO: 39); 

(z) ac-KKKKGACDLKAKEWFWLCAA-am (SEQ ID NO: 46); 

(a') ac-GACELLGWEWAWLCC-am (SEQ ID NO: 47); 

(b') ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 1 6); 

(c') ac-KKKKGACELLGWEWAWLCAA-am (SEQ ID NO: 1 7); 

(d') ac-GACSRSQPEWEWLCAA-am (SEQ ID NO: 36); 
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(e') ac-KKGACSRSQPEWEWLCAA-am (SEQ ID NO: 40); 

(f ) ac-KKKKGACSRSQPEWEWLCAA-am (SEQ ID NO: 48); 

(g') ac-GACLLRAPEWGWLCAA-am (SEQ ID NO: 37); 

(h') ac-KKGACLLRAPEWGWLCAA-am (SEQ ID NO: 41); 

(i') ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ID NO: 49); 

0 ') . ac-GACMRGEWEWSWLCAA-am (SEQ ID NO: 50); 

(k') ac-KKGACMRGEWEWSWLCAA-am (SEQ ID NO: 1 8); 

00 ac-KKKKGACMRGEWEWSWLCAA-am (SEQ ID NO: 51); 

(m0 ac-GACPPLNKEWAWLCAA-am (SEQ ID NO: 52); 

(n') ac-KKGACPPLNKEWAWLCAA-am (SEQ ID NO: 19); 

(o0 ac-KKKKGACPPLNKEWAWLCAA-am (SEQ ID NO: 53); 

(p0 ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 54); 

(q0 ac-KKGACXXXXXEWXWLCAA-am (SEQ ID NO: 55); 

(r0 ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID NO: 56); 

(s') ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 57); 

(t0 ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 58); 

(uO ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 59); 

(vO ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 60); 

(wO ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 61); 

(xO ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 62); and 

(yO a variant of a sequence of (a) through (x0, wherein the variant binds 

the N-helix coiled-coil cavity of HIV gp41, wherein ac- at the C- 

terminus and -am at the N-terminus are optional. 



D-peptides described herein, which are ligands shown to bind the N-helix 
25 pocket, are also useful in drug screens to identify compounds or molecules (e.g., 
from chemical libraries, recombinantly produced products, naturally-occurring 
substances, culture media or supernatants) which bind the N-helix pocket and thus, 
are also inhibitors of HIV. For example, a competitive assay can be carried out by 
combining a D-peptide which binds the N-helix cavity (e.g., a D-peptide described 
30 herein); IQN1 7 (e.g., in the natural L-handedness), or another fusion protein which 
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is a soluble model that presents the N-helix cavity; and a candidate inhibitor (a 
compound or molecule to be assessed for its ability to bind the N-helix cavity). For 
example, D10pep5 or DlOpepl, IQN17, and a candidate inhibitor (candidate drug) 
can be combined using buffer conditions and peptide concentrations appropriate for 
5 binding of D 1 0pep5 or D 1 Opepl to IQN1 7. The extent to which binding of the D- 
peptide occurs is determined and compared to the extent to which binding occurs 
under the same conditions, but in the absence of a compound or molecule (referred 
to as a candidate drug or candidate inhibitor) to be assessed for its ability to bind the 
N-helix coiled-coil cavity of HIV gp41 envelope protein (in a control). If binding of 

10 DIOpepS or DlOpepl occurs to a lesser extent in the presence of the candidate 

inhibitor (test sample) than in its absence (control sample), the candidate inhibitor is 
a ligand which binds the N-helix coiled-coil cavity and, thus, is an inhibitor. 
Inhibitors identified in this manner can be further assessed for their activity in viral 
infectivity assays and synctia formation assays, such as those described herein. 

1 5 Those inhibitors which show activity in such assays can be further assessed in an 
appropriate animal model or in humans. 

Any method by which binding of the D-peptide, known to bind the N-helix 
cavity, can be detected can be used to assess whether the candidate inhibitor 
interferes with binding. For example, the D-peptide can be detectably labeled and 

20 the extent to which the label appears on the N-helix cavity (as a result of binding of 
the D-peptide) detected, in the presence and in the absence of the candidate 
inhibitor. If less label appears on the N-helix cavity of IQN17 (or other appropriate 
fusion protein) in the presence of the candidate inhibitor (in the test sample) than in 
its absence (in the control sample), then the candidate inhibitor is a ligand which 

25 binds the N-helix cavity (and interferes with binding of the D-peptide). 

Alternatively, the D-peptide (e.g., DIOpepS or DlOpepl) and IQN17 can be labeled 
with a fluorophore (e.g., with EDANS; S-^'aminoethyOaminonaphthalene-l- 
sulfonic acid) with an appropriate quencher that quenches the fluorescent signal of 
the fluorophore when it is in close proximity (e.g., DABCYL; 4-(4'- 

30 dimethylaminophenylazo)benzoic acid). If the candidate inhibitor binds the N-helix 
cavity of IQN17, fluorescence is observed, since, as a result of binding of the 
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candidate inhibitor, the D-peptide is not brought into sufficiently close proximity to 
the quencher to permit it to quench the reporter signal. Alternatively, the fluorescent 
reporter molecule can be on the IQN17 and an appropriate quencher on the D- 
peptide. In either case, the position of the reporter or quencher on IQN1 7 must be 

5 such that when the D-peptide binds the N-helix cavity, the reporter and quencher 
moieties are in sufficiently close proximity to each other that quenching occurs 
(Tyagi, S., et al y Nature Biotechnology 16:49 (1998)). 

Also the subject of this invention are drugs (compounds and molecules) 
which bind the N-helix coiled-coil pocket of HTV gp41 and inhibit (partially or 

10 totally) HIV entry into cells. In one embodiment, these drugs can be identified as 
described herein or by other methods. Drugs which bind the N-helix coiled-coil 
pocket of HIV gp41 are useful as therapeutic agents (to prevent HIV entry into cells 
or reduce the extent to which it occurs), as research tools (e.g., to study the 
mechanism of HIV gp41 function) and to assess the rate of viral clearance by an 

15 individual (e.g., in an animal model or an infected human). 

Also the subject of this invention are compositions, useful in methods of 
interfering with entry of HIV into a mucosal cell; these compositions comprise an 
appropriate carrier or base and at least one component selected from the group 
consisting of: 



20 


(a) 


C34 peptide; 




(b) 


DP178; 




(c) 


T649; 




(d) 


T1249; 




(e) 


a derivative of (a) - (d); 


25 


(0 


a D-peptide which binds to the hydrophobic pocket of HIV gp41; 




(g) 


a derivative of (f); 




(h) 

0) 


a combination of two or more of (a)-(g); and 

a molecule that inhibits HTV infectivity by binding to the N-helix 



coiled coil. 



30 The compositions can comprise one such component or two or more components. 
A further subject of this invention are compositions (e.g., proteins or 
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proteinaceous materials) that can be used to elicit an immune response (e.g., 
antibody production) that will protect (partially or totally) against HIV infection 
and/or disease. Such compositions are useful as protective agents (e.g., vaccines) 
and to obtain antibodies (monoclonal and/or polyclonal) that are useful as research 
5 tools, diagnostic tools, drug screening reagents, and to assess viral dynamics (rates 
of production and clearance of virus) in animal models or infected humans. 

Also the subject of this invention is a list of atomic coordinates for the X-ray 
crystal structure of the complex between IQN17 and DlOpepl. Also the subject of 
this invention is a list of coordinates for the X-ray crystal structure of IQN17. These 

10 coordinates can be used (e.g., as an electronic file for computer graphics programs) 
to create a model of the complex which indicates how DlOpepl binds to the N-helix 
coiled-coil cavity and models of the N-helix coiled-coil cavity. Such models can be 
used, in methods known to those of skill in the art such as in computer graphics 
modeling, to build new models to evaluate the likelihood of binding to the N-helix 

15 coiled-coil cavity by other peptides, peptidomimetics, small molecules, drugs or 
other compounds. Such models can also be used to build new models for the 
structures of molecules (peptides, peptidomimetics, small organic molecules, drugs' 
or other compounds) that bind the N-helix coiled-coil cavity (e.g., H. Kubinyi, Curr. 
Op. DrugDiscov. Develop., 7:16 (1998); P.L. Wood, ibid, 7:34 (1998); J.R. 

20 Morphy, ibid, 7:59 (1998)). These models and the corresponding lists of atomic 
coordinates can be used to identify, evaluate, discover and design more effective 
and/or new D-peptides, L-peptides, peptidomimetics, other small molecules or drugs 
that inhibit HIV infectivity, using methods known to those of skill in the art. A 
further subject of this invention is a method of producing or identifying a drug 

25 which fits (packs into, binds) the N-helix coiled-coil pocket of HIV gp41 through 
the use of atomic coordinates of a crystal, such as a crystal of a soluble, trimeric 
peptide model of the HIV gp41 hydrophobic pocket described herein (e.g., IQN17 or 
a variant thereof), a crystal of such a model in complex with a D-peptide (e.g., 
IQN17 or a variant thereof in complex with a D-peptide described herein, such as 

30 DlOpepl) or a crystal of the N-peptide region of HIV gp41 comprising the amino 
acid residues which comprise the pocket of the N-helix coiled-coil of HIV gp41. 
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The method comprises obtaining a crystal of the soluble model, such as the empty 
soluble model (not in complex with a D-peptide), obtaining the atomic coordinates 
of the crystal (e.g., of the crystal of the empty soluble model, such as IQN17); using 
the atomic coordinates obtained to define the N-helix coiled-coil pocket of HIV 
5 gp41 ; identifying a molecule or compound which fits the N-helix coiled-coil pocket 
and obtaining the molecule or compound; contacting the molecule or compound 
with the N-helix coiled-coil pocket (e.g., by contacting it with a polypeptide which 
comprises the pocket (e.g., IQN17 or a variant thereof or the N-peptide) to assess 
(determine) the ability of the molecule or compound to fit the pocket of HIV gp41, 

10 wherein in the molecule or compound fits the pocket, it is a drug which fits the N- 
helix coiled-coil pocket, whereby a drug which fits the pocket is produced. The 
atomic coordinates of the crystal can be obtained by X-ray diffraction studies or 
form a computer file or Protein Data Base (PDB), such as the PDB presented herein 
for IQN17 (Figures 1 1 A-l IV). 

15 Similarly, the method can be carried out using a crystal of a soluble trimeric 

model in complex with a D-peptide (e.g., a D-peptide described herein, such as 
D 1 Opep 1 ) or a crystal of the N-peptide region of HIV gp4 1 which comprises the 
pocket of the N-helix coiled coil. 

Drugs produces in this manner can be further assessed to conform their 

20 ability to fit into the pocket (e.g., by NMR) and can be assessed for their ability to 
inhibit HIV entry into cells (e.g., by a syncytia assay or infectivity assay). 

The teachings and entire contents of all documents cited herein are expressly 
incorporated by reference into this application. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 is a schematic of HTV-1 gp41 showing the N36 

(SGIVQQQNNLLRAIEQQHLLQLTVWGDCQLQARIL) (SEQ ID NO: 13) and C34 
(WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL) (SEQ ID NO: 14) peptides 
located within two regions containing 4,3 hydrophobic heptad repeats (labeled 
heptad repeat 1 and heptad repeat 2, also referred to as N-peptide region and C- 
30 peptide region, respectively). The underlined residues in C34 were mutated in this 
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study. Three of these residues (W, W and I) project into the N36 cavity, whereas 
two of these residues (M and R) do not. FP, fusion peptide; S-S, disulfide bond; 
TM, transmembrane region; INTRA, intraviral region. 

Figure 2 is a graph showing the correlation of C34 inhibitory potency with 
5 N36/C34 stability. C34 peptide variants containing substitutions at position Trp 631 
were tested for inhibition of viral entry (filled circles) and cell-cell fusion (open 
circles). IC 50 values are plotted on a logarithmic scale against the Tm (melting 
temperature) of the corresponding N36/C34 complex. The identities and chemical 
structures of the substitutions are drawn under the corresponding data points. In 

1 0 order of increasing hydrophobic bulk, the substitutions were: glycine (Gly), alanine 
(Ala), L-a-aminobutyric acid (Abu), valine (Val), leucine (Leu), phenylalanine 
(Phe), the wildtype residue tryptophan (Trp), and L-p-(l-naphthyl) alanine (Nal). 
Error bars indicate the standard error from triplicate experiments. 

Figure 3 shows the amino acid sequences of D-peptides (SEQ ID NOS: 34, 

15 38, 15, 35, 16, 17, 36, 40, 41, 18 and 19) and the consensus sequence (SEQ ID NO.: 
12). As represented, each peptide is flanked by GA on the N-terminus and AA on 
the C-terminus, and comprises a blocking group at the N-terminus: (Acetyl-GA-C- 
1 Omer-C- AA-CONH 2 ; this can also be represented as ac-GA-C-lOmer-C-AA-am). 
The single letter conventions which are used to represent amino acid residues are as 

20 follows: G=glycine; A=alanine; C=cysteine; D=aspartic acid; L=leucine; K=lysine; 
E=glutamic acid; W=tryptophan; F=phenylalanine; R=arginine; H=histidine; 
S=serine; and Q=glutamine. 

Figure 4 is a schematic representation of mirror-image phage display with 
the D-IQN17 target, in which: (1) rounds of phage selection are carried out to 

25 identify binders to D-IQN1 7; (2) individual clones are sequenced; (3) binding 

specificity is assessed by determining whether the phage bind to the gp41 region of 
D-IQN17; (4) D-peptides of those phage sequences which bind are produced; and 
(5) the anti-HIV activity of the D-peptides is assayed. 

Figures 5 A and 5B show the crystal structure of IQN17 bound to DIOpepl. 

30 IQN17 is shown to be a continuous three-stranded coil, and binding of the conserved 
amino acid residues of DIOpepl is shown to be to the hydrophobic pocket of IQN17, 
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formed by the 17 residues derived from HIV gp41. Figure 5 A shows IQN17, 
consisting of GCN4-pI Q I residues fused to H1V-1 gp41 residues and the binding of 
DlOpepl to the hydrophobic pocket of IQN17 (area within box). The D-peptide 
which binds to the pocket is represented by the branched extensions (i.e., stick 
5 representation). Figure 5B is an enlargement of the area within the box and shows 
the conserved residues that pack into the pocket (Trp, Trp Leu) as well as a glutamic 
acid (Glu). 

Figures 6A and 6B show results of syncytia assays, using the D-peptides 
described herein. Figure 6A is a graphic representation of results of syncytia assays. 
10 Figure 6B represents IC 50 data for D-peptides, with results from one or more 
experiments. 

Figures 7A-7N are the PDB file which lists the atomic coordinates for the 
crystal structure of DlOpepl bound to IQN17, in which residues 0-28 of the A chain 
are derived from the GCN4-pI Q I sequence (with three mutations), residues 29-45 of 

15 the A chain are derived from the HIVgp41 sequence, residues 0-16 of the D chain 
represent the D-peptide, ordered water molecules are represented as W, and a bound 
chloride ion as chain I. Residue 0 represents the acetyl group. The PDB file 
represents a monomer; the trimer is formed by crystallographic symmetry. 
Figures 8 A and 8B show results of assessment of inhibition of HTV-1 

20 membrane fusion by a D-peptide. Figure 8A shows results of syncytia assay with no 
D-peptide. Figure 8B shows results of syncytia assay with D-peptide. 

Figures 9A-9C show results of ! H NMR experiments characterizing the 
aromatic residues of IQN17/D-peptide complexes. Figure 9 A shows 1D-NMR 
spectra of DlOpepl a (top), IQN17 (middle) and a 1:1 complex of DlOpepl a and 

25 IQN17 (bottom). The x-axis is the same as for (C) below. Upfield peaks assigned 
to the four scalar-coupled aromatic ring protons of Trp-571 are indicated. The 
unmarked upfield peak of the bottom trace corresponds to an unassigned Ha 
" resonance. Figure 9B shows 1 D spectra of 1 : 1 complexes between IQN 1 7 and each 
D-peptide (as labeled). The same four protons are indicated in some spectra. Figure 

30 9C shows a 2D-NMR TOCSY spectrum of IQN17/D10pepla complex. Cross-peaks 
linking these four tryptophan protons are indicated, along with specific assignments. 
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The TOCSY mixing time was 42ms. 

Figure 10 shows the conformation of the DlOpepl peptide as in the complex 
with IQN1 7, as determined by X-ray crystallography. 

Figures 1 1 A ? 1 IV are the PDB file which lists the atomic coordinates for 
5 the crystal structure of IQN1 7, in which residues 0-28 of the A, B and C chains of 
the IQN17 trimer are derived from GCN4-pIqI sequence (with three mutations), 
residues 29-45 of the chains A, B, and C are derived from HIV gp41, ordered water 
molecules are represented as W, and a bound chloride ion as chain I. The PDB file 
represents a whole trimer in the crystallographic asymmetric unit. 

1 0 DETAILED DESCRIPTION OF THE INVENTION 

The gp41 subunit of the HIV-1 envelope protein mediates fusion of viral and 
cellular membranes. The crystal structure of the gp41 ectodomain core is a six-helix 
bundle composed of three helical hairpins, each consisting of an N-helix paired with 
an antiparallel C-helix (D. C. Chan, D. Fass, J. M. Berger, P. S. Kim, Cell. 59:263 

15 (1997), 

W. Weissenhorn, A. Dessen, S.C. Harrison, J. J. Skehel, D. C. Wiley, Nature, 
387:426 (1997); K. Tan, J. Liu, J. Wang, S. Shen, M. Lu, Proc. Nail Acad. Set 
USA, 94:12303 (1997). Three N-helices form an interior, trimeric coiled-coil, and 
three C-helices wrap around the outside of this N-helix coiled-coil along conserved, 

20 hydrophobic grooves. This structure likely corresponds to the core of the fusion- 
active state of gp41 (discussed in D. C. Chan, D. Fass, J. M. Berger, P. S. Kim, Cell, 
59:263 (1997), and D.C. Chan and Peter S. Kim, Cell, 93:681 (1998)) and shows 
similarity to the proposed fusogenic structures of envelope fusion proteins from 
influenza (P. A. Bullough, F/M. Hughson, J. J. Skehel, D. C. Wiley, Nature, 371:31 

25 (1994)), Moloney Murine Leukemia Virus (D. Fass, S. C. Harrison, P. S. Kim, Nat. 
Struct Biol, 5:465 (1996)), and simian immunodeficiency virus (SIV). (V.N. 
Malashkevich, D.C. Chan, C.T. Chutkowski, P.S. Kim, Proc. Natl Acad. Sci. USA, 
95:9134 (1998), M. Caffrey et al, EMBOJ., 77:4572 (1998)), and Ebola virus (W. 
Weissenhorn et al, Mol Cell 2:605 (1998), V. N. Malashkevich et al, Proc. Natl 

30 Acad. Sci. USA, 96:2662 (1999).) 

SUBST^ (RULE 28) 
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Synthetic C-peptides (peptides corresponding to the C-helix), such as DP 178 
and C34, are potent inhibitors of HTV-1 membrane fusion and are effective against 
both laboratory-adapted strains and primary isolates (V. N. Malashkevich, D. C. 
Chan, C. T. Chutkowski, P. S. Kim, Proc. Natl Acad. ScL USA, 95:9134 (1998), 
5 DP 178 corresponds to residues 638-673 of HIV- 1 gp41 and is acetylated at the 
amino terminus and amidated at the carboxy terminus (C. T. Wild, D. C. Shugars, 
T. K. Greenwell, C. B. McDanal, T. J. Matthews, Proc. Natl Acad. ScL USA, 
97:9770 (1994), S. Jiang, K. Lin, N. Strick, A.R. Neurath, Nature, 365:1 13 (1993)). 
A Phase I clinical trial with the C-peptide DP 178 (also called T-20) indicates that it 

10 has antiviral activity in vivo, resulting in reduced viral loads (M. Saag, et al 9 abstract 
#771 presented at the Infectious Disease Society of America 35 th Annual Meeting, 
San Francisco, CA, 16 September 1997; Kilby, J.M. et al Nature Med. 4:1302-1307 
(1998)). Based on the structural features of the gp41 core, these peptides are thought 
to act through a dominant-negative mechanism, in which exogenous C-peptides bind 

15 to the central coiled-coil of gp41 and lead to its inactivation (D.C. Chan and P.S. 
Kim, Cell 93:681 (1998); R.A. Furuta et al, Nat. Struct. Biol, 5:276 (1998); D. C 
Chan, D. Fass, J. M. Berger, P. S. Kim, Cell, 59:263 (1997), W. Weissenhorn, A. 
Dessen, S,C. Harrison, J. J. Skehel, D. C Wiley, Nature, 387:426 (1997); K. Tan, J. 
Liu, J. Wang, S. Shen, M. Lu, Proc. Natl. Acad. Sci. USA, 94:12303 (1997), M. Lu, 

20 S. C Blacklow, P. S. Kim, Nat. Struct. Biol, 2:1075(1995) and C. H. Chen, T. J. 
Matthews, C. B. McDanal, D. P. Bolognesi, M. L. Greenberg, J. Virol, 69:3771 
(1995)). These peptides likely act on a pre-hairpin intermediate of gp41 that forms 
when the native gp41 structure (i.e., the nonfusogenic conformation present on free 
virions) is perturbed by gpl20/CD4/coreceptor interactions. This pre-hairpin 

25 intermediate is proposed to have an exposed N-coiled-coil, thereby allowing C- 
peptides to bind and inactivate gp41 prior to the formation of the fusion-active 
hairpin structure (D. C. Chan, P. S. Kim, Cell, 93:681 (1998)). This model is further 
supported by immunoprecipitation experiments indicating that the C-peptide DP 178 
binds to gp41 (R. A. Furuta, C. T. Wild, Y. Weng, C. D. Weiss, Nat. Struct. Biol, 

30 5:276 (1998)). In addition, viruses escaping DPI 78 inhibition show mutations in the 
central coiled-coil region of gp41 (L. T. Rimsky, D. C. Shugars, T. J. Matthews, J. 
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ViroL, 72:986(1998)). 

Recent crystallographic studies of gp41 facilitate the development of small- 
molecule peptidomimetic drugs which, in contrast to C-peptides, have the potential 
to be orally administered. Within each coiled-coil interface is a deep cavity, formed 

5 by a cluster of residues in the N-helix coiled-coil, that is an attractive target for the 
development of antiviral compounds. Three residues from the C-helix (Trp 628 , 
Trp 631 , and He 635 ) insert into this cavity and make extensive hydrophobic contacts. 
Mutational analysis indicates that two of the N-helix residues (Leu 568 and Trp 571 ) 
comprising this cavity are critical for membrane fusion activity (J. Cao, et al, X 

10 virol, 67:21 Al (1993)). Therefore, it is reasonable to expect that compounds that 
bind with high affinity to this cavity and prevent normal N- and C-helix pairing will 
be effective HIV-1 inhibitors. In addition, residues in the cavity are highly 
conserved among diverse HIV-1 isolates. Because of the high structural 
conservation, drugs targeting this site would have broad activity against diverse 

15 HIV-1 isolates, and possibly HIV-2 isolates. 

Although this hypothesis is tempting, until now, it had not been 
demonstrated that these cavity contacts are important for the potency of the C34 
inhibitor. In fact, some C-peptides that lack the cavity-binding residues, such as 
DP 178 (C. T. Wild, D. C. Shugars, T. K. Greenwell, C. B. McDanal, T. J. Matthews, 

20 ibid, 97:9770 (1994); Kilby, J.M. et al, Nature Med., 4:1302 (1998)), are highly 
effective inhibitors of HIV-1 membrane fusion. These concerns emphasize the need 
for systematic structure- function analysis to identify determinants of C-peptide 
activity. 

To determine the role of cavity-contacts in inhibitory activity, structure- 
25 based mutagenesis was performed on C34. The core of the gp41 ectodomain (Figure 

1) was reconstituted with two synthetic peptides called N36 and C34 (M. Lu, P. S. 

Kim, J. Biomol Struct. Dyn. t 75:465 (1997), D. C. Chan, D. Fass, J. M. Berger, P. S. 

Kim, Cell, 59:263 (1997)). Variants of the C34 peptide with single alanine 

substitutions were synthesized, and the helical content and thermal stability of 
30 mutant N36/C34 complexes were quantitated by circular dichroism. As expected, 

mutation of C34 residues (Met 629 , Arg 633 ) that do not contact the N36 coiled-coil had 
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little effect on mean residue ellipticity at 222 nm (a measure of helical content) or 
stability of N36/C34 complexes (Table 1). However, mutation of the three residues 
(Trp 628 -- Ala, Trp 63, -+ Ala or lie 635 -. Ala) that project into the N36 coiled-coil cavity 
resulted in N36/C34 complexes with substantially decreased mean ellipticity and 
5 stability (Table 1). The greatest destabilization was observed with the mutant Trp 631 
-►Ala, which formed N36/C34 complexes with an apparent melting temperature (T m ) 
of 37 °C, compared to 66 °C for wildtype. These results demonstrate that C34 
residues making hydrophobic contacts with the N36 coiled-coil cavity are important 
for stabilizing the helical-hairpin structure of the gp41 ectodomain core. 

10 To determine the importance of these residues in the ability of C34 to inhibit 

membrane fusion, the activity of C34 peptides was tested in HTV-1 viral entry and 
syncytium assays (Table 1). Mutations (Met 629 -* Ala and Arg 633 ->Ala) that had little 
effect on the stability of the N36/C34 complex also had little effect on the inhibitory 
activity of wildtype C34 (IC 50 ~2. 1 nM and -0.55 nM for viral entry and syncytium 

15 formation, respectively). However, mutation of the strictly conserved Trp 628 or 

Trp 631 to alanine resulted in a substantial decrease in activity of~5 fold and -30-fold, 
respectively (Table 1). Mutation of the less well-conserved He 635 resulted in only a 
-2-fo.ld decrease in inhibitory activity. These results demonstrate for the first time, 
the C34 residues which make contact with gp41 pocket are important for the 

20 inhibitory potency of C34. 

The relationship between the potency of mutant C34 peptides and the 
stability of mutant N36/C34 complexes was clarified by taking advantage of the 
greatly destabilizing effect of the Trp 631 mutation to construct a series of N36/C34 
complexes with a gradation of stabilities. The Tip 631 position was used as a "guest 

25 site" and the tryptophan was substituted with natural and artificial amino acids 

representing a broad range of hydrophobic bulk. In order of increasing hydrophobic 
bulk, the substitutions were: glycine (Gly), alanine (Ala), L-a-aminobutyric acid 
(Abu), valine (Val), leucine (Leu), phenylalanine (Phe), the wildtype residue 
tryptophan (Trp), and L-P-(l-naphthyl) alanine (Nal). This approach resulted in a 

30 set of C34 peptides that form N36/C34 complexes with T m s ranging from 37 °C to 
66°C. The T m s and [8] 222 (10 3 deg cm 2 dmol* 1 ) for the N36/C34 variants (with IC 50 
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values (nanomolar) for virus entry and cell fusion, respectively, in parentheses) are: 
Trp 63, -Gly,35°C, 17.1 (38 ± 6.1, 25 ± 3.8); Trp 63, -Ala, 37°C, -24.9 (40 ±4.3, 15 ± 
.0.8); Tip°'-Abu, 43°C; -23.2 (16 ± 4.8, 6.9 ± 0.4); Trp 63, -Val, 43°C, -23.9 (13 ± 
2.8, 4.5 ± 0.09); Trp 63, -Leu, 50°C, -26.7 (5.3 ± 1.0, 3.2 db 0.1); Trp 63, -Phe, 59°C, - 
5 26.3 (3.6 ± 0.8, 1.6 ± 0.05); wildtype, 66°C, -31.7 (1.5 ± 0.2, 0.55 ± 0.03); 
Trp 0, -Nal, 62°C, -32.0 (1.4 ± 0.3, 0.79 ± 0.08). The concentration of the 
Trp^-Nal peptide was measured by Nal absorbance using the extinction coefficient 
e = 6900 at 282 nm (J. Blake, C. H. Li, J. Med. Chem., 75:423-426 (1975)). In 
HIV-1 infection and syncytium assays, this series of peptides showed potencies that 

1 0 closely correlated with the T m of the corresponding N36/C34 complex (Figure 2). 
The potency order of these mutants is wt~Nal>Phe>Leu>Val~Abu>Ala~Gly, in 
close agreement with the hydrophobic bulk of the substitution and the stability of 
N36/C34 complexes. There is a striking linear relationship when the IC 50 is plotted 
on a logarithimic scale as a function of the Tm (Figure 2). Since AG= -RTInK (AG, 

15 change in free energy; R, gas constant; T, absolute temperature; and K, equilibrium 
constant) and AT m (T m< wildlypc compIex -T m , mutam complw ) is proportional to A(AG) (AG 
wiidtypec^picx-AG mutant complcx ) (W. J. Becktel, J.A. Schellman, Biopolymers, 26:1859 
(1987)), the observed linear relationship strongly suggests that the potency of the 
C34 variants is directly related to their affinity for the N-helix coiled-coil, as 

20 predicted by a dominant-negative mode of inhibition. These results provide strong 
support for the proposal that the coiled-coil cavity in the gp41 core is an attractive 
drug target. Conserved residues projecting into the hydrophobic cavity clearly play 
a major role in the ability of C34 to inhibit HIV-1 infection, indicating that this 
inhibitor works by forming a high-affinity complex with the N-helix coiled-coil. 

25 Moving beyond traditional peptides, mirror-image phage display techniques (T. N. 
Schumacher, et al. Science, 277:1854 (1996)), selection-reflection aptamer 
techniques (K.P. Williams et a/., PNAS, 94: 1 1285 (1997); S. Klupmann et al, Nat 
Biotech., 4:1112 (1996); A. Nolte et al, Nat Biotech., 74:1116 (1996), 
combinatorial chemistry (A. Borchardt, S. D. Liberies, S. R. Biggar, G.R. Crabtree, 

30 S.L. Schreiber, Chem. Biol. 4:961 (1997); J.C. Chabala, Curr. Opin. Biotechnol, 
6:632 (1995)) and computational approaches in structure-based drug design (H. 
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Kubinyi, Curr. Opin. Drug Discov. Develop., 7:16 (1998)), can be used to identify 
D-peptides, peptidomimetics, and small molecules that bind with high affinity to the 
coiled-coil cavity. The close correlation between N36/C34 stability and C34 
inhibitory potency suggests that the effectiveness of such compounds will depend 
5 critically on the strength of their cavity-contacts. These results suggest that 
candidate compounds can be tested for the ability to form a stable complex with 
N36, thereby providing a basis for rapid, quantitative screens to identify and 
evaluate potential inhibitors of HIV- 1 entry. 

Small-molecule inhibitors directed against the cavity of the central coiled- 

10 coil target one of the most highly conserved regions of the HTV-1 envelope proteins. 
The analogous cavity in the SIV gp41 core has an essentially identical structure, 
with conservation of side chain conformations (V. N. Malashkevich, D. C. Chan, C. 
T. Chutkowski, P. S. Kim, Proc. Natl. Acad. ScL USA, 95:9134 (1998)). This high 
degree of structural conservation explains the broad neutralizing activity of C- 

15 peptides, which are effective against laboratory-adapted strains as well as primary 
isolates (C. T. Wild, D. C. Shugars, T. K. Greenwell, C. B. McDanal, T. J. 
Matthews, Proc. Natl. Acad. Sci. USA, 97:9770 (1994), S. Jiang, K. Lin, N. Strick, 
A.R, Neurath, Nature, 365:113 (1993)). Remarkably, SIV C34 peptide is nearly as 
effective as HIV-1 C34 in inhibiting HIV-1 infection (V. N. Malashkevich, D. C. 

20 Chan, C. T. Chutkowski, P. S. Kim, Proc. Natl. Acad. ScL USA, 95:9134 (1998)). 
In addition, a C-peptide (T649) containing the cavity-binding region is much less 
susceptible to the evolution of resistant virus (L. T. Rimsky, D. C. Shugars, T. J. 
Matthews, J. Virol., 72:986 (1998)) than DP 178 (also called T-20), which lacks this 
region. These observations are evidence that high-affinity ligands targeting the 

25 coiled-coil surface, particularly its cavity, will have broad activity against diverse 
HIV isolates (including HIV-2) and will be less likely to be bypassed by drug-escape 
mutants. 

These studies on the mechanism of C-peptide action also support the 
hypothesis that the trimeric hairpin structure of the gp41 core (Chan, D.C. et al. 9 
30 Cell, 89:263 (1997); Weissenhorn, W. et al, Nature, 357:426 (1997); Tan, K. et al. y 
Proc. Natl. Acad. Sci. USA, 94:12303 (1997)) corresponds to the fusion-active state 
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of gp41. The work described herein shows that the inhibitory potency of C34 
depends on its ability to bind to the N-coiled-coil of gp41 . Since the hairpin 
structure of gp41 is extremely stable (with a melting temperature in excess of 90°C) 
(Lu, M. et ai, Nat Struct. Biol. 2:1075 (1995)), it is unlikely that nanomolar 
5 concentrations of C34 can disrupt this structure once it has formed, especially given 
the high effective concentration of the N- and C-helices within an intact gp41 
molecule. Rather, C-peptides likely act prior to the formation of the gp41 hairpin by 
binding to a transient pre-hairpin intermediate, in which the central coiled-coil is 
exposed. Binding of C-peptides to this pre-hairpin intermediate inactivates gp41 

10 and prevents its conversion to the fusion-active hairpin structure (D. C. Chan, P. S. 
Kim,Ce//. 93:681 (1998)). 

As described herein, the pocket on the surface of the N-helix coiled-coil of 
HIV-1 envelope protein gp41 subunit is a drug target. Similarly, cavities on other 
pathogens (e.g., HIV-2) which can cause AIDS or on pathogens which cause AIDS- 

15 like conditions in nonhuman mammals (e.g., SIV) are also drug targets. As 
described herein, available methods (e.g., mirror image phage display methods, 
combinational chemistry, computational approaches and other drug screening and 
medicinal chemistry methods) can be used to identify peptides, D-peptides, 
peptidomimetics and small molecules that bind the coiled-coil cavity of HIV-1 

20 (and/or HIV-2) with sufficient affinity to interfere with viral entry into cells and, 
thus, inhibit viral infection. As further described herein (Example 3), mirror image 
phage display has been used to identify D-peptides which bind to a cavity on the 
surface of the N-helix coiled-coil of HIV-1 gp41 . 

As a result of the work described herein, screening assays which identify 

25 molecules or compounds (agents or drugs) that prevent C34/N36 complex formation 
and/or disrupt the complex once it has formed are available, as are methods of 
identifying molecules or compounds (agents or drugs) which bind the N-helix 
coiled-coil pocket of HIV gp41 . Such drugs or agents are useful to inhibit (totally or 
partially) HIV entry into cells and, thus, infection by HIV. 

30 Methods of screening for compounds or molecules (referred to as drugs or 

agents) that interfere with formation of a stable complex between C34 and N36 or 
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disrupt a complex between the two and methods of screening for compounds or 
molecules that bind the N-helix coiled-coil pocket of HIV gp41 are a subject of the 
present invention. 

In one embodiment of a screening method of the present invention, drugs 
5 which interfere with formation of a complex between C34 peptide and N36 peptide 
are identified by combining a candidate drug (a compound or molecule to be 
assessed for its ability to interfere with formation of a complex between C34 and 
N36) with C34 and N36, thus forming a test sample, under conditions appropriate 
for formation of a complex between C34 and N36 and determining whether 

1 0 formation of C34/N36 complex is inhibited (partially or totally) in the test sample. 
Results of this assessment can be compared with the results of an appropriate 
control, which is the same combination as the test sample, except that the candidate 
drug is not present; the control is subjected to the same conditions as is the test 
sample. If C34/N36 complex is not formed or is formed to a lesser extent in the 

1 5 presence of the candidate drug (in the test sample) than in its absence, the candidate 
drug is a drug that interferes with formation of a stable complex between C34 and 
N36. Such a drug is also referred to herein as an inhibitor of C34/N36 complex 
formation. Inhibition of complex formation can be assessed by determining the 
extent to which binding of the two members of the complex occurs, such as by 

20 means of a fluorescence assay (e.g., FRET), in which C34 and N36 are each labeled 
by a member of a pair of donor-acceptor molecules or one end of one of the peptides 
(e.g., the N-terminus of C34) is labeled with one member of such a pair (EDANS) 
and the natural fluorophore tryptophan, present in the N36 peptide, is the other 
member of the donor/acceptor pair. Binding of the C34 and N36 is assessed by the 

25 extent to which light emission (FRET) occurs from the acceptor model and/or the 
wavelength spectrum of the light emitted is altered. Prevention of binding by the 
candidate drug alters the extent to which light is emitted and/or prevents the shift in 
wavelength that would occur if binding of C34 and N36 occurred. Alternatively, 
C34 can be labeled with a detectable label, such as a radiolabel (e.g., by synthesizing 

30 a variant C34 with a kinase recognition site that can be labeled with a kinase and 
radioactive ATP). The radiolabeled C34 and the candidate drug are combined with 
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N36 immobilized to, for example, a solid surface (e.g., a bead or a plastic well), thus 
producing a test sample. The extent to which binding of labeled C34 with 
immobilized N36 occurs is determined and compared with the extent to which 
binding of labeled C34 to immobilized N36 occurs under the same conditions to 
5 which the test sample is subjected, but in the absence of the candidate drug (in a 
control sample). Typically, this assessment is carried out after the sample has been 
maintained for sufficient time and under appropriate conditions for C34/N36 binding 
to occur and a subsequent wash to remove any unbound C34 and candidate drug. If 
binding occurs in the test sample to a lesser extent than in the control sample, as 

10 evidenced by less radiolabel bound to the immobilized N36 in the test sample than 
in the control sample, the candidate drug is an inhibitor of binding of C34 and N36. 
Alternatively, the label or tag on C34 can be a member of a binding pair, the other 
member of which is used to detect binding to N36. For example, C34 can be tagged 
with biotin (through standard solid-state peptide synthesis, for example) and 

1 5 combined with N36, which can be in solution or bound to a solid surface, such as a 
bead, well or flat/planar surface, along with the candidate drug (test sample) or in the 
absence or the candidate drug (control sample). Binding of C34 to N36 is assessed 
by detecting the presence of biotin associated with N36, such as through the use of 
labeled streptavidin (e.g., streptavidin - HRP, streptavidin - AP or iodinated 

20 streptavidin), which binds the biotin on C34 and is then itself detected through its 
label. If binding occurs less in the presence of the candidate drug (in the test 
sample) than in the absence of the candidate drug (in the control sample), as 
indicated by the presence of less biotin detected on N36 in the test sample than in 
the control sample, the candidate drug is an inhibitor of C34/N36 binding. The 

25 candidate drugs can be obtained, for example, from a library of synthetic organic 
compounds or random peptide sequences, which can be generated synthetically or 
through recombinant technology. 

In a similar fashion, the ability of a candidate drug to disrupt C34/N36 
binding can be assessed, to identify inhibitors of C34/N36 and, thus, of HIV 

30 infection. In this embodiment, preformed C34/N36 complex is combined with a 
candidate drug, which is to be assessed for its ability to disrupt the complex, thus 
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producing a test sample. The control sample is the same as the test sample, except 
that the control sample does not contain the candidate drug; it is treated in the same 
manner as the test sample. If C34/N36 binding is disrupted in the presence of the 
candidate drug and not in the control sample or if disruption of the complex occurs 
5 to a greater extent in the test sample than in the control sample, the candidate drug is 
an inhibitor (disrupter) of C34/N36. Detection of disruption of binding can be 
carried out as described above for detection of/prevention of/interference with 
binding of C34/N36 (e.g., by FRET or a fluorescence assay, by detecting a 
radiolabel or other detectable label, such as biotin.) 

10 Results described herein demonstrate that hybrids (i.e., fusion proteins) can 

be made between a trimeric version of the coiled-coil region of a protein (such as 
GCN4) and the N-helix coiled-coil of HIV gp41, and that such hybrids are trimeric 
(i.e., not aggregated) and 100% helical. Results described herein also clearly 
indicate that such fusion proteins do not disrupt or alter the structure of the N- 

15 peptide large cavity (i.e., hydrophobic pocket), which is essentially the same in 
IQN17 (ligand-free and in complex with DlOpepl; see Example 5) as it is in N36 
(i.e., in complex with C34; Chan D.C. et al Cell 89, 263 (1997)). 

Figures 5 A, 5B and 6 present results of assessment of peptides described 
herein. In Figure 5A-5B, the IQN17 crystal structure is shown to be a continuous, 

20 three-stranded coiled-coil; the 17 residues derived from HIV gp41 form a 

hydrophobic pocket very similar to that found in the crystal structure of gp41 . As 
shown, DlOpepl is bound to this pocket and the residues of DlOpepl that 
correspond to the conserved residues (leucine, tryptophan, tryptophan) found in all 
of the D-peptide inhibitors described herein are packed into this pocket, clearly 

25 indicating that other D-peptide inhibitors which comprise these conserved residues 
would bind to IQN1 7 in the same manner. Figure 6 shows results of syncytia assays 
carried out according to the method described by Chan et al. (Chan, D. C. et al 
Proc. Natl Acad. ScL t 95: 15613-15617 (1998)). In the experiments whose results 
are represented in Figure 6, D-peptides identified as described herein were used. In 

30 each instance, a blocking group (e.g., an acetyl group) was present at the N terminus 
and a CONH 2 (amide) was present at the C-terminus. Results of these assays 
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showed a range of IC 50 concentrations, where IC 50 is the concentration at which one 
half of the number of syncytia are observed, compared to the control, in which no 
peptide is included. For example, DIOpepS with two lysines at the N-terminus has 
an IC 50 of approximately 6^. 

5 In another embodiment, the invention relates to a method of identifying a 

drug that binds the N-helix coiled-coil cavity of HIV gp41. Here, too, the assay is 
based on assessing loss or decrease in binding, but unlike the C34/N36 complex 
assay described above, which is a more general assay in that it covers or detects 
interaction with any portion of the groove formed by the N-helical region of HIV 

10 gp41, this embodiment focuses on the HIV gp41 hydrophobic pocket (the N-helix 
coiled-coil cavity). In this embodiment, the method comprises combining a 
candidate drug to be assessed for its ability to bind the N-helix coiled-coil cavity of 
HIV gp41 with a fusion protein that comprises a trimeric version of the coiled-coil 
region of a protein and a sufficient portion of the N-peptide of HIV gp41 to include 

1 5 the HIV gp41 cavity, under conditions appropriate for presentation of the HIV gp41 
cavity for binding by a peptide or other molecule and determining (e.g., in a high - 
throughput screen) whether the candidate drug binds the fusion protein. If binding 
occurs, the candidate drug is a "hit" that may be a drug that binds the N-helix coiled- 
coil cavity of HIV gp41 . If binding occurs, the candidate drug has bound the N- 

20 helix coiled coil and it can be determined if it binds to the coiled-coil cavity. Such 
"hits" can then be screened in secondary assays, such as the cell/cell fusion assay 
and HIV infectivity assay to determine if the candidate drug is a drug. Alternatively, 
or in addition, such "hits" can be assessed further by use of a counterscreen with 
other fusion proteins (or peptides), to which pocket-binding molecules will not bind. 

25 For example, GCN4-pIqI (with the same three surface mutations as in IQN1 7) or a 
version of IQN17 with a point mutation in the hydrophobic pocket, IQN17(G39W), 
in which glycine 39 is mutated to tryptophan, resulting in a large protrusion into the 
pocket, can be used in a counterscreen. In this example, a candidate drug that binds 
to IQN17 but not to GCN4-pI Q I (with the same three surface mutations as in IQN17) 

30 or IQN17(G39W) is a drug that binds the N-helix coiled-coil cavity of HIV gp41 . 
In a further embodiment, a competitive assay is carried out. In this 
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embodiment, a peptide or protein that binds the N-helix coiled-coil cavity of HIV 
gp41 is combined with the candidate drug and the fusion protein and whether the 
candidate drug binds the HIV gp41 cavity is determined in the presence of the 
peptide that binds the N-helix coiled cavity of HIV gp41 . If the candidate drug 

5 binds the fusion protein, it is a drug that binds the HIV gp4 1 cavity. For example, a 
fusion protein which comprises a trimeric version of the coiled-coil region of GCN4 
and the C-terminus of the N peptide of HIV gp41 that includes the N-helix coiled- 
coil cavity (IQN17) is combined with a "reference" D-peptide (e.g., any of the D- 
peptides described herein or variants thereof) that binds the N-helix coiled-coil 

10 cavity and a candidate drug to be assessed for its ability to bind the N-helix coiled- 
coil cavity of HIV gp41, thus producing a test sample, which is maintained under 
conditions appropriate for binding of the D-peptide to bind to the cavity. A control 
sample, which includes the same components as the test sample, except for the 
candidate drug, and is handled in the same manner as the test sample, is also 

15 assessed. In both samples, binding of the reference D-peptide is assessed. If 
binding of the reference D-peptide occurs to a lesser extent in the presence of the 
candidate drug (in the test sample) than in its absence (in the control sample), the 
candidate drug is a drug that binds the N-helix coiled-coil cavity of HIV gp41 . 
Detection of binding is assessed, for example, in a similar manner as described 

20 above for the C34/N36 embodiment of the invention. For example, the D-peptide is 
labeled with a detectable label, such as a radiolabel or a first member of a binding 
pair (e.g., biotin), and the extent to which the N-helix coiled-coil cavity bears the 
label (after the samples have been maintained under conditions appropriate for 
binding of the reference D-peptide to the cavity) is determined. In the case in which 

25 radiolabeling is used, the extent to which the fusion protein bears the radiolabel is 
assessed in the test sample and compared with the extent to which the fusion protein 
bears the radiolabel in the control sample. If the detectable label is a first member of 
a binding pair (e.g. biotin), the second member of the pair (a binding partner) is 
added to the samples in order to detect the extent to which the fusion protein is 

30 bound by the reference D-peptide. This can be done directly or indirectly (e.g., by 
adding a molecule, such as an antibody or other moiety which binds the second 
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member of the binding pair). Less of the label will be present on the fusion protein 
(N-helix coiled-coil cavity) if the candidate drug has inhibited (totally or partially) 
binding of the D-peptide to the cavity. If binding occurs to a lesser extent in the test 
sample (in the presence of the candidate drug) than in the control sample (in the 

5 absence of the candidate drug), then the candidate drug is a drug that binds the N- 
helix coiled-coil cavity of HIV gp41. 

IQN17, or a variant thereof, in the D-enantiomer, is useful to identify 
molecules or compounds which are members of a library or collection and bind the 
N-helix coiled-coil of gp41. For example, a library or collection of molecules or 

10 compounds, such as a phage display library, can be screened with IQN1 7 in the D- 
enantiomer to identify members that bind the pocket. This has been carried out 
successfully, as described herein. The mirror image of IQN17, or a variant thereof, 
is used as the target molecule. As used herein, the terms "D-enantiomer of a 
polypeptide" and "D-peptide" refer to the exact mirror image of the molecule in the 

15 natural handedness. Thus, for amino acid residues that contain a second chiral 

center, such as He and Thr, the exact mirror image of the naturally-occurring amino 
acid residue is used to create the D version of the polypeptide. Also as used herein, 
the terms <e D-amino acids" and "L-amino acids" are both meant to include the non- 
chiral amino acid glycine. D-IQN17 can be immobilized to a solid surface, such as 

20 by addition of one member of a binding pair (e.g., biotin) to it and addition of the 
other member of the pair (e.g., streptavidin) to the solid surface. Binding of the two 
members results in immobilization of D-IQN17 on the solid surface, such as for 
phage panning. A linker which is an enzyme recognition site (e.g., an amino acid 
linker such as Gly-Lys-Gly, in which an L-lysine residue is used) can be placed 

25 between the D-IQN17 sequence and the binding pair member (between the biotin 
and D-IQN17) to provide an enzyme recognition site (here, a trypsin recognition 
site), so that bound phage can be eluted by a trypsin digestion, rather than by non- 
specific elution, such as acid addition. The phage display library can be a library of 
L-amino acid peptides of any appropriate length fused to an appropriate phage gene. 

30 In one embodiment, it is a phage display library of L-amino acid peptides fused to 
the gfflgene of Ml 3 phage. The peptides, in one embodiment, comprise 10 
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randomly encoded amino acid residues flanked by either a cysteine or a serine on 
both sides. Typically, several rounds of panning are carried out. D-IQN17-specific 
binding phage are identified. Phage that bind only the gp41 region of D-IQN17 can 
be identified by post-panning assessment, such as by screening against wells that 
5 lack the antigen and then further testing against a panel of molecules. For example, 
specific pocket-binding phage include those that bind D-IQN17 but not D-GCN4- 
pI Q I (with the same three surface mutations as in IQN17) or a version of D-IQN17 
with a point mutation in the hydrophobic pocket, D-IQN17(G39W), in which 
glycine 39 is mutated to tryptophan, resulting in a large protrusion into the pocket. 

1 0 D-peptides identified in this manner can be assessed for their ability to inhibit HIV 
gp41, using known assays, such as the cell/cell fusion assay and HIV infectivity 
assay. The mirror-image phage display method described herein has demonstrated 
the value of IQN17 and IQN17(G39W), and their D-enantiomers in identifying 
inhibitors of HIV- 1 entry that bind the gp41 pocket. Of nine specific pocket-binding 

1 5 phage sequences identified (phage that bind to D-IQN1 7 but not to D- 

IQN1 7(G39W), eight contain a consensus EWXWL sequence and inhibit HIV-1 
gp41 -induced syncytia formation when tested as D-peptides. The ninth peptide was 
toxic to cells and was not investigated further. 

The D-versions of IQN17 and IQN17(G39W) can be used in a similar 

20 manner with other biologically encoded libraries, to discover other pocket-binding 
molecules that are not subject to enzymatic degradation by natural enzymes. For 
example, other phage-display libraries can be used to identify new D-peptide 
inhibitors (e.g., with a different number of residues between the flanking Cys 
residues and/or with randomly encoded amino acid residues outside the regions 

25 flanked by cysteine residues and/or with more than two cysteine residues). 

Strategies for encoding peptide libraries without phage (e.g., in which the encoding 
mRNA is attached to the peptide) can be used to identify D-peptide inhibitors. RNA 
or DNA libraries can be used (e.g., with SELEX methods) to identify L-ribose- or L- 
deoxyribose-based RNA or DNA aptamers, respectively, that bind to the 

30 hydrophobic pocket and are not substrates for natural nucleases (see e.g., Williams 
et a/., PNAS, 74: 1 1285 (1997)). 
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Although the versions of IQN17 and IQN17(G39W) of natural L-handedness 
can also be used in similar manner with biologically encoded libraries, the most 
likely applications will be with other, non-biological ly encoded libraries. For 
example, chemical combinatorial libraries on beads (of the one-bead, one-compound 
5 variety) can be screened with labeled IQN17 (e.g., radioactive or with a 

chromophore) to identify beads containing molecules that bind to IQN17. In this 
. example, IQN17(G39W) can be used as a counterscreen to determine if the 
molecules on the bead bind to the pocket of IQN17. (If they bind to IQN17(G39W), 
then they are not likely to be pocket-binding molecules.) As another example, beads 

10 to which IQN1 7 had been previously attached can be incubated with a mixture of 
potential pocket-binding molecules (e.g., a mixture of chemicals, or a natural 
product extract). IQN17 (bound to the beads) can then be separated from the 
mixture, washed, and then subjected to conditions (e.g., organic solvent, low pH, 
high temperature) that elute molecules bound to the IQN17 on the beads. The eluted 

1 5 molecules (i.e., potential pocket-binding molecules) could be identified by analytical 
chemistry methods (e.g., HPLC, mass spectrometry). A counterscreen with 
IQN17(G39W) is useful to help to identify true pocket-binding molecules. 

Drugs identified by the methods described above are then further tested for 
their ability to inhibit (totally or partially) HIV gp41 function (membrane fusion) 

20 and, thus entry into cells, using further in vitro assays, such as the syncytium assays 
and/or infectivity assays described herein or others known to those of skill in the art, 
and/or in vivo assays in appropriate animal models or in humans. 

One embodiment of the present invention is a method of identifying a drug 
that binds the N-helix coiled-coil of HIV gp41, particularly the N-helix coiled-coil 

25 pocket. The method comprises combining a candidate drug to be assessed for its 
ability to bind the N-helix coiled-coil pocket of HIV gp41 and peptide which 
comprises a soluble, trimeric coiled-coil and a sufficient portion of the N-peptide of 
HIV gp41 to include the HTV gp41 pocket, under conditions appropriate for 
presentation of the HIV gp41 pocket for binding by a molecule or compound (e.g., a 

30 drug) and determining whether the candidate drug binds the HIV gp41 pocket. If 
binding of the candidate drug with the HIV gp41 pocket occurs, the candidate drug 
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is a drug which binds the N-helix coiled-coil pocket of HIV gp41. Optionally, 
binding of the candidate drug can be assessed in the assay as described above, 
except that a peptide that binds the N-helix coiled-coil pocket (a peptide previously 
identified as one which binds the pocket) is combined with the candidate drug and 
5 the peptide. In this competitive assay, binding of the candidate drug to the N-helix 
coiled-coil pocket is assessed in the presence of a known binding moiety (a molecule 
or compound which binds the pocket). If binding of the candidate drug occurs in the 
presence of the known binding moiety, the candidate drug is a drug which binds the 
N-helix coiled-coil pocket with sufficient affinity to successfully compete with the 

10 known binding moiety. The fusion protein used in this embodiment comprises a 
soluble, trimeric version of a coiled-coil, such as a soluble, trimeric version of the 
coiled-coil region of a protein (e.g., a non-HIV protein, such as that of GCN4 or 
GCN4-pIqI, although an HIV protein can be used) and a sufficient portion of the N- 
peptide of HIV gp41 to include the HIV gp41 cavity. For example, this portion can 

15 comprise SEQ ID NO.: 20 or a sufficient portion to comprise the cavity and, when 
present in an appropriate fusion protein or other soluble model, present the cavity in 
such a manner that it is available for binding. Alternatively, a variant of the HIV 
gp41 sequence present herein, a sequence from another strain of the human virus 
(e.g., HIV-2) or a sequence from another species (e.g., SIV, feline 

20 immunodeficiency virus, Visna virus (M. Singh et al y J. Mol Biol, 290:1031 

(1999)) can be used in the fusion protein or soluble model. The fusion protein can 
comprise a soluble, trimeric version of the coiled-coil of any protein, provided that 
when it is in the fusion protein with the HIV component, the HIV cavity is presented 
in such a manner that it is available for binding. It can be, for example, that of 

25 GCN4-pIqI, GCN4-pH, Moloney Murine Leukemia Virus (Mo-MLV) or the ABC 
heterotrimer. In one embodiment, the fusion protein is IQN1 7 in the D- form. In 
another embodiment, the fusion protein is IQN17 in the natural L-handedness. 

In the competitive assay format, any peptide known to bind the N-helix 
coiled-coil cavity can be used as the known binding moiety. For example, any of the 

30 peptides described herein (SEQ ID NOS.: 3-12, 15, 17-19, 23, 24) or a variant or 
portion thereof can be used. Also, any non-peptide pocket-binding molecule can be 
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used in the competitive assay format. The competitive assay can be performed in 
solution, on a bead, or on a solid surface. 

In one embodiment, the candidate drug is detectably labeled and binding of 
the candidate drug to the HIV gp41 N-helix coiled-coil is determined by detecting 

5 the presence of the detectable label on the HIV gp41 N-helix coiled-coil (as a result 
of binding of the labeled candidate drug to the N-helix coiled-coil). Detection of the 
label on the helix coiled-coil pocket of the soluble model is indicative of binding of 
the candidate drug to the N-helix coiled-coil pocket and demonstrates that the 
candidate drug is a drug which binds the N-helix coiled-coil pocket. If the labeled 

10 candidate drug is detected on the fusion protein, the candidate drug is a drug which 
binds the N-helix coiled-coil cavity. 

In another embodiment of the method of identifying a drug that binds the N- 
helix coiled-coil pocket of the HIV gp41, a soluble model that presents the pocket in 
such a manner that it is available for binding by a drug is combined with a candidate 

1 5 drug and whether binding of the candidate drug with the N-helix coiled-coil of the 
soluble model occurs is determined. If binding occurs, the candidate drug is a drug 
which binds the pocket. Here, too, a competitive assay format can be used. The 
components of the competition assay (e.g., IQN1 7 and a D-peptide) can be labeled, 
with any of a variety of detectable labels, including fluorophore/quencher 

20 combinations. The candidate drug can be labeled, as described above, with any of a 
variety of detectable labels. The components of the soluble model (fusion protein) 
used in this embodiment and the competing moiety which is used in a competitive 
assay format can also be as described above. 

The present invention also relates to a method of producing a drug that binds 

25 the N-helix coiled-coil pocket of HIV gp41 . In one embodiment, the method is 

carried out as follows: A soluble model that presents the N-helix coiled-coil pocket 
of HIV gp41 or a fusion protein which comprises a soluble, trimeric coiled-coil (e.g., 
of a protein, such as a non-HIV protein, such as GCN4-pI Q I, GCN4-pII, Mo-MLV, 
ABC he'terotrimer or an HIV protein) is combined with a candidate drug to be 

30 assessed for its ability to bind the N-helix coiled-coil pocket of HIV gp41 and inhibit 
entry into cells, under conditions appropriate for presentation of the HIV gp41 
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pocket for binding by a drug. Whether the candidate drug binds the HIV gp41 
pocket is determined, wherein if binding of the candidate drug to the N-helix coiled- 
coil pocket of HIV gp41 occurs, the candidate drug is a drug which binds the N- 
helix coiled-coil cavity of HIV gp41. In this embodiment, the fusion protein 

5 comprises a soluble, trimeric coiled-coil (e.g., of a protein such as a non-HIV 
protein, such as a soluble, trimeric coiled coil of GCN4, GCN4-pIQI, GCN4-pII, 
Mo-MLV, ABC heterotrimer or an HIV protein) and a sufficient portion of the N- 
peptide of HIV gp41 to include the HIV gp41 N-helix coiled-coil pocket (e.g., all or 
a portion of SEQ ID NO.: 20, a variant or modification thereof or a sequence from 

10 another strain or species). IQN17, described herein, can be used in this method; the 
D enantiomer of IQN17 can also be used (e.g., in mirror-image phage applications). 
The ability of the drug produced to inhibit HIV entry into cells is assessed, for 
example, in a syncytium assay and/or an infectivity assay, as described herein. It 
can be further assessed in an appropriate animal model or in humans. 

1 5 The invention also relates to a method of producing a drug that binds the N- 

helix coiled-coil pocket of HIV gp41. The method comprises: producing or 
obtaining a soluble model of the N-helix coiled-coil pocket of HTV gp41 (e.g., a 
fusion protein as described herein and particularly IQN17 or a variant thereof); 
combining a candidate drug (a molecule or compound) to be assessed for it ability to 

20 bind the N-helix coiled-coil pocket of HIV gp41 and the soluble model of the N- 
helix coiled-coil pocket of HIV gp41 and determining whether the candidate drug 
binds the N-helix coiled-coil pocket of HIV gp41. If the candidate drug binds the N- 
helix coiled-coil pocket of HIV gp41, the candidate drug is a drug which binds the 
N-helix coiled-coil pocket of HIV gp41; as a result, a drug which binds the N-helix 

25 coiled-coil cavity of HIV gp41 is produced. The fusion protein used in this 

embodiment is described herein and can be, for example, IQN17, the D enantiomer 
of IQN17, or variants thereof. Alternatively, a drug that binds the N-helix coiled- 
coil pocket of HIV gp41 and inhibits entry of HIV into cells can be produced by a 
method comprising: producing or obtaining a soluble model of the N-helix coiled- 

30 coil pocket of HIV gp41 , as described herein; combining the soluble model and a 
candidate drug to be assessed for its ability to bind the N-helix coiled-coil pocket of 
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HIV gp41; determining whether the candidate drug binds the N-helix coiled-coil 
pocket of the soluble model (fusion protein), wherein if binding occurs, the 
candidate drug is a drug which binds the N-helix coiled-coil of HIV gp41; and 
assessing the ability of the drug which binds the N-helix coiled-coil to inhibit HTV 
5 entry into cells, wherein if the drug inhibits HIV entry into cells, it is a drug which 
binds the N-helix coiled-coil pocket of HIV gp41 and inhibits HTV entry into cells. 
Its ability to inhibit HIV entry into cells can be assessed in vitro (e.g., in a syncytium 
assay, an infectivity assay) or in vivo (e.g. in an appropriate animal model or in 
humans). The soluble model can be a peptide which comprises a soluble, trimeric 
1 0 coiled-coil, such as that of a protein (e.g., GCN4-pIqI) and a sufficient portion of the 
N-peptide of HIV gp41 to include the HIV gp41 pocket. 

Drugs identified or produced by the methods described herein, as well as by 
other methods, which bind the N-helix coiled-coil pocket of HIV gp41 and inhibit 
HIV entry into cells are also the subject of this invention. 
1 5 Drugs identified or produced by the methods describee! herein, as well as by 

other methods, which bind to more than one N-helix coiled-coil pocket of HIV gp41 
and inhibit HIV entry into cells are also the subject of this invention. Such drugs 
can be obtained, for example, by linking two or more pocket-binding molecules 
(drugs) via an appropriate linker (e.g., a linker of amino aicd residues or other 
20 chemical moieties) to increase the effectiveness of inhibition. The pocket-binding 
molecules that are linked can be the same or different. Drugs identified or produced 
by the methods described herein or by other methods which bind to the N-helix 
coiled-coil pocket of HIV gp41, in addition to binding to HIV gpl20, CD4, CCR5, 
CXCR4, or a non-pocket region of HIV gp41 are also the subject of this invention. 
25 Drugs which inhibit HIV gp41 can also be designed or improved with 

reference to the X-ray crystal structure of the complex between IQN17 and a D- 
peptide which binds the N-helix coiled-coil cavity presented by IQN17, such as with 
reference to the X-ray structure of the complex between IQN17 and DlOpepl, 
presented herein. Alternatively, or in addition, drugs which inhibit HIV gp41 can 
30 also be designed or improved with reference to the X-ray crystal structure of free 
IQN17, presented herein. 
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Compounds and molecules (drugs) identified as described herein inhibit 
(partially or totally) entry of HIV into cells, and thus are useful therapeutically in 
uninfected individuals (humans) and infected individuals (e.g., to prevent or reduce 
infection in an uninfected individual, to reduce or prevent further infection in an 

5 infected individual) and as research reagents both to study the mechanism of gp41- 
induced membrane fusion and to assess the rate of viral clearance by an individual 
and as reagents to discover or develop other compounds and molecules (drugs) that 
inhibit entry of HIV into cells. D-peptides described herein (e.g., D10pep5, 
DlOpepl) have been shown, using the infectivity assay described herein, to inhibit 

10 infection of cells. Other D-peptides can be similarly assessed for their ability to 
inhibit infectivity. 

The drugs can be administered by a variety of route(s), such as orally, 
nasally, intraperitoneally, intramuscularly, vaginally or rectally. In each 
embodiment, the drug is provided in an appropriate carrier or pharmaceutical 

1 5 composition. For example, a cavity-binding drug can be administered in an 

s 

appropriate buffer, saline, water, gel, foam, cream or other appropriate carrier. A 
pharmaceutical composition comprising the drug and, generally, an appropriate 
carrier and optional components, such as stabilizers, absorption or uptake enhancers, 
flavorings and/or emulsifying agents, can be formulated and administered in 

20 therapeutically effective dose(s) to an individual (uninfected or infected with HIV). 
In one embodiment, drugs which bind the N-helix coiled-coil of gp41 (e.g., those 
described herein, DP178 (C. T. Wild, D. C. Shugars, T. K. Greenwell, C. B. 
McDanal, T. J. Matthews, ibid, 97:9770 (1994)), T649 which corresponds to 
residues 1 17-152 of HIV-1 gp41 (HXB2 strain) and is acetylated at the amino 

25 terminus and amidated at the carboxy terminus) (L. T. Rimsky, D. C. Shugars, T. J. 
Matthews, J. Virol, 72:986 (1998), are administered (or applied) as microbicidal 
agents and interfere with viral entry into cells. For example, a drug or drugs which 
bind(s) the HIV cavity can be included in a composition which is applied to or 
contacted with a mucosal surface, such as the vaginal, rectal or oral mucosa. The 

30 composition comprises, in addition to the drug, a carrier or base (e.g., a cream, foam, 
gel, other substance sufficiently viscous to retain the drug, water, buffer) appropriate 
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for application to a mucosal surface or to the surface of a contraceptive device (e.g., 
condom, cervical cap, diaphragm). The drug can be applied to a mucosal surface, 
such as by application of a foam, gel, cream, water or other carrier containing the 
drug. Alternatively, it can be applied by means of a vaginal or rectal suppository 
5 which is a carrier or base which contains the drug or drugs and is made of a material 
which releases or delivers the drug (e.g., by degradation, dissolution, other means of 
release) under the conditions of use (e.g., vaginal or rectal temperature, pH, moisture 
conditions). Such compositions can also be administered orally (e.g., swallowed in 
capsule, pill, liquid or other form) and pass into an individual's blood stream. In all 

10 embodiments, controlled or time release (gradual release, release at a particular time 
after administration or insertion) of the drug can be effected by, for example, 
incorporating the drug into a composition which releases the drug gradually or after 
a defined period of time. Alternatively, the drug can be incorporated into a 
composition which releases the drug immediately or soon after its administration or 

1 5 application (e.g., into the vagina, mouth or rectum). Combined release (e.g., release 
of some of the drug immediately or soon after insertion, and over time or at a 
particular time after insertion) can also be effective (e.g., by producing a 
composition which is comprised of two or more materials: one from which release 
or delivery occurs immediately or soon after insertion and/or one from which release 

20 or delivery is gradual and/or one from which release occurs after a specified period). 
For example, a drug or drugs which bind the HIV cavity can be incorporated into a 
sustained release composition such as that taught in U.S. Patent 4,707,362. The 
cream, foam, gel or suppository can be one also used for birth control purposes (e.g., 
containing a spermicide or other contraceptive agent), although that is not necessary 

25 (e.g., it can be used solely to deliver the anti-HIV drug, alone or in combination with 
another non- contraceptive agent, such as an antibacterial or antifungal drug or a 
lubricating agent). An anti-HIV drug of the present invention can also be 
administered to an individual through the use of a contraceptive device (e.g.,- 
condom, cervical cap, diaphragm) which is coated with or has incorporated therein 

30 in a manner which permits release under conditions of use a drug or drugs which 
bind the HTV gp41 N-helix coiled coil. Release of the drug(s) can occur 
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immediately, gradually or at a specified time, as described above. As a result, they 
make contact with and bind HFV and reduce or prevent viral entry into cells. 

In another embodiment, a drug which interferes with HIV entry into cells by 
a mechanism other than binding to the gp41 N-helix coiled-coil cavity (e.g., a drug 

5 which interferes with viral entry by interfering with gp 1 20 binding at the CD4 stage) 
is administered or applied to a mucosal surface as described above for drugs which 
bind to the gp41 N-helix coiled coil. 

Fusion proteins of the present invention comprise a soluble, trimeric form or 
version of a coiled-coil, such as a soluble, trimeric form or version of a coiled-coil 

1 0 region of a protein (of non-HIV origin or of HIV origin) and a sufficient portion of 
the C-terminal end of the N peptide of HIV gp41 to include (comprise) the HIV 
coiled-coil cavity or hydrophobic pocket (the pocket-comprising residues of the N- 
peptide). The N peptide of HIV gp41 can be that of HIV-1, HIV-2, another HTV 
strain or a strain from another species (e.g., simian immunodeficiency virus (SIV), 

1 5 feline immunodeficiency virus or Visna virus). For example, HIV-2 sequence 
LLRLTVWGTKNLQARVT (SEQ ID NO: 26), SIV sequence 
LLRLTVWGTKNLQTRVT (SEQ ID NO: 27) or a sequence comprising invariant 
residues in HIV-1, HIV-2 and SIV (represented LLXLTVWGXKXLQXRXX (SEQ 
ID NO: 42), wherein amino acid residues L, T, V, W, G, K, Q, and R are the single 

20 letter code used for amino acid residues and X can be any amino acid residue). Also 
the subject of this invention is a soluble trimeric model of the HIV gp41 
hydrophobic pocket, which can be a D-peptide or an L-peptide and comprises a 
soluble trimeric coiled coil and a sufficient portion of the N peptide region of HIV 
gp41 to comprise the amino aicd residues which form the pocket of the N-helix 

25 coiled-coil region of HIV gp41 . The D- or L-peptide can comprise as the soluble, 
trimeric coiled coil the coiled coil of GCN4-pI Q I, of GCN4-pII, of Moloney Murine 
Leukemia Virus or of the ABC heterotrimer. The component which is a sufficient 
portion of the N peptide of HIV gp41 to comprise the amino acid residues of the 
pocket can comprise, for example: LLQLTVWGIKQLQARIL of HIV-1 (SEQ ID 

30 NO: 20); LLRLTVWGTKNLQARVT of HIV-2 (SEQ ID NO: 26); 

LLRLTVWGTKNLQTRVT of SIV (SEQ ID NO: 27) or the invariant residues of 



WO 00/06599 



PCT/US99/17351 



-41- 

these, which are: LLXLTVWGXKXLQXRXX (SEQ ID NO: 42). 

One embodiment of the instant invention are fusion proteins between a 
trimeric version of the coiled-coil region of a protein (such as GCN4-pIqI) and the 
N-helix coiled-coil of HIV gp41 that include all, part or none of the N-helix cavity. 
5 That is, a fusion protein of the present invention can comprise a trimeric form of the 
coiled-coil region of GCN4-pI Q I and a portion of the N-peptide of HIV- 1 gp41, 
wherein the portion of the N-peptide of gp41 comprises part, or all, or none of the 
N-helix cavity of HIV- 1 gp41 . For example, a fusion protein can be made that 
contains residues from GCN4-pIqI and residues from N36. The fusion protein, 

10 denoted IQN24n, contains 29 residues of GCN4-pIqI, including three mutations for 
increased solubility, and 24 residues from the N-terminal end of N36 
(SGIVQQQNNLLRAIEAQQHLLQLT) (SEQ ID NO 21); for recombinant 
expression in E. coli, an extra Met residue is included at the N-terminus. For 
example, a fusion protein can comprise a portion of the N-peptide of HIV gp41 

15 comprising the amino acid sequence of (SEQ ID.: 21). The sequence of IQN24n is: 
MRMKQffiDKIEEffiSKQKKIENEIARIKm 

T (SEQ ID.: 22). This fusion protein can be made by a variety of methods, 
including chemical synthesis or recombinant DNA methods or by recombinant 
expression in E. coli, in which case the N- and C-termini are not blocked. Because 

20 the superhelix parameters of the GCN4-pIqI coiled coil are nearly identical to the 
HIV gp41 N-helix coiled coil, the resulting fusion protein molecule (IQN24n) is 
predicted to form a long trimeric coiled coil, which presents part of the gp41 N-helix 
coiled coil as a trimer (not aggregated). 

An alternative embodiment of the instant invention provides a method of 

25 eliciting an immune response in an individual. The strategy used to create a soluble, 
trimeric model for part of the gp41 N-terminal region coiled coil is also helpful to 
develop HIV vaccine candidates. One goal for a potential HIV vaccine is to elicit a 
neutralizing antibody response that binds to the "pre-hairpin" intermediate of the 
HIV-1 gpl20/gp41 envelope protein complex. In this transient form, the N-helix 

30 region of gp41 is exposed, but the C-helix region is not. Although it seems 

reasonable to use an N-peptide (such as N36, N5 1 or DP-107) as an immunogen to 
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elicit an antibody response against the N-helix region of gp41 t the isolated N- 
peptides are aggregated and do not properly present the gp41 N-helix coiled-coil 
trimer. Accordingly, the same strategy described herein to solve this problem for the 
gp41 hydrophobic pocket can be applied towards the development of soluble, 

5 trimeric models of the gp41 N-helix coiled-coil region, in general. Such trimeric 
models (including IQN 17, but also including, for example, peptides that do not 
contain the pocket residues of gp41) can be used as immunogens to elicit an 
antibody response to the pre-hairpin intermediate, thereby inhibiting HIV-1 
infection. For example, an individual to be immunized can be administered a fusion 

1 0 protein comprising a trimeric form of a coiled-coil region of a protein and a portion 
of an N-peptide from HIV-1 gp41, wherein the portion from gp41 comprises part of, 
all of, or none of the N-helix coiled-coil cavity in a pharmaceutical^ acceptable 
carrier. For example, IQN24n can be used, either alone or in combination with other 
materials, in a vaccine, which will elicit the production of antibodies that bind to the 

1 5 coiled coil in the individual to whom it is administered (the vaccinee), and thereby 
offer protection against infection and/or disease. IQN24n can also be used to 
identify (from humans, other animals or antibody libraries) and/or raise antibodies 
(monoclonal and/or polyclonal) that bind to the N-helix coiled coil. This provides 
the basis for a diagnostic method in which IQN24n (or IQN 17 or other soluble 

20 trimeric model) is used to assess the presence/absence/level of antibodies that bind 
the N-helix coiled coil in a biological sample (e.g., blood). 

r 

Any of a wide variety of variations can be made in the GCN4-pIqI 
component of fusion proteins described herein (e.g., IQN17 or IQN24n) and used in 
the method, provided that these changes do not alter the trimeric state of the coiled- 

25 coil. Changes can also be made in the amino acid composition of the fusion protein 
component which is the portion from the HIV gp41 N36 peptide, to produce variants 
(e.g., variants of IQN17 or IQN24n). There is no limit to the number or types of 
amino acid residue changes possible, provided that the trimeric state of the coiled- 
coil and the structure of the surface of the fusion protein corresponding to the N- 

30 peptide coiled coil of HIV gp41 are maintained. The fusion protein component 
which is the portion of the HIV gp41 N-peptide can include all, part, or none of the 
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N-helix cavity. For example, other parts of N51, N36, DP-107, or other regions of 
the HIV gp41 N-helix region can be fiised to GCN4-pI Q I (or another trimeric version 
of the coiled-coil region of a protein) to generate trimeric (not aggregated) helical 
coiled-coil fusion proteins and used in the method. There is no limit to the number 

5 or types of fusion proteins that can be designed and generated, provided that the 
trimeric state of the coiled-coil and the structure of the surface of the fusion protein 
corresponding to the N-peptide coiled coil of HIV gp41 are maintained. Such fusion 
proteins can be designed and generated using methods known to those of skill in the 
art, such as evaluating heptad-repeat positions or superhelix parameters of coiled 

10 coils. 

Described herein are peptides, which can be D-peptides or L-peptides, 
which bind to a cavity on the surface of the N-helix coiled-coil of HIV envelope 
glycoprotein gp41 (e.g., HIV-1, HIV-2). Such peptides can be of any length, 
provided that they are of sufficient length to bind the cavity in such a manner that 

1 5 they interfere with the interaction of the N-helix coiled-coil cavity and amino acid 
residues of the C-peptide region of HIV gp41 and prevent HIV entry into the cells. 
For example, D- or L-peptides comprise at least two amino acid residues and 
generally will be from about two to about 21 amino acid residues. That is, they can 
comprise any number of amino acid residues from about two to about 21. The 

20 amino acid residues can be naturally occurring or non-naturally occurring or 
modified, as described below. The peptides can be linear or circular. 

Examples of D-peptides, identified as described herein, are shown in Figure 
3. Because of library design, each peptide, in addition to the amino acid residues 
shown, is flanked by GA on the N-terminus and AA on the C-terminus. N-terminal 

25 lysine residues were added to improve water solubility. 

In one embodiment, the present invention provides compounds which inhibit 
the binding of the N-helix coiled coil to the C-helix of HIV-1 gp41 envelope protein. 
Such compounds are of use in a method of treating a patient infected by, or 
potentially subject to infection by, HIV. These compounds are also of use in a 

30 method of assessing the ability of a second compound to bind to the N-helix coiled 
coil cavity. 
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10 



In one embodiment, the compounds which inhibit the binding of the N-helix 
coiled coil to the C-helix of HIV-1 gp41 envelope protein are of Formula I, 



wherein A, B, D and E are each, independently, a D-amino acid residue, an L-amino 
acid residue, or an N-substituted glycyl residue. Natural or nonnatural amino acid 
residues can be used. K, L, M and N are each, independently, an amino acid residue 
or a polypeptide group of from 2 to about 6 amino acid residues which can be the 
same or different, and n, p, q and r are each, independently, 0 or 1 . F is a direct bond 
or a difunctional linking group and s is 0 or 1. 

In one subset of the compounds of Formula I, A is a D- amino acid residue, 
an L-amino acid residue or an N-substituted glycyl residue of the formula 



where one of R A1 and is a substituted or unsubstituted aryl, heteroaryl, 
arylmethyl, heteroarylmethyl, benzo-fused aryl, benzo-fused heteroaryl, benzo-fused 
arylmethyl, benzo-fused heteroarylmethyl, cycloalkyl or bicycloalkyl; and the other 
is hydrogen. W is hydrogen, methyl, trifluoromethyl or halogen, for example, 
fluorine, chlorine, bromine or iodine. 

B is a glycyl residue or D-amino acid or N-substituted glycyl residue of the 
formula 



(N >-(Mij- A— B— D E— 7 (K^- (I-*, 




(I), 



I I II 

N C C 
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Rbi X ° 



I 

■N C C 



RB2 

where one of R BI and R B2 is a substituted or unsubstituted linear, branched or cyclic 
alkyl, aryl, arylalkyl, heteroaryl or heteroarylalkyl group; and the other is hydrogen. 
X is hydrogen, methyl, trifluoromethyl or halogen, such as fluorine, chlorine, 
bromine or iodine. 

D is a D- amino acid residue or N-substituted glycyl residue of the formula 

i m I S 

N C C 



R02 

where one of R D1 and R D2 is a substituted or unsubstituted aryl, heteroaryl, 
arylmethyl, heteroarylmethyl, benzo-fused aryl, benzo-fused heteroaryl, benzo- 
1 0 fused arylmethyl; benzo-fused heteroarylmethyl, cycloalkyl or bicycloalkyl; and the 
other is hydrogen. Y is hydrogen, methyl, trifluoromethyl or halogen, such as 
fluorine, chlorine, bromine or iodine. 

E is a D-amino acid residue or N-substituted glycyl residue of the formula 

Rei Z ° 

I I 
N C— 



RE2 

1 5 where one of R El and R E2 is a substituted or unsubstituted, linear, branched or cyclic 
alkyl, aryl or arylalkyl group; and the other is hydrogen. Z is hydrogen, methyl, 
trifluoromethyl or halogen, such as fluorine, chlorine, bromine or iodine. 

K, L, M and N are each, independently, composed of from 1 to about 6 
(which can be the same or different), D-amino acid residues, L-amino acid residues, 

20 N-substituted glycyl residues or a combination thereof. Natural or nonnatural amino 
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acid residues can be used. One or more of the amino acid residues or N-substiuted 
glylcyl residues can, optionally, be substituted at the cc-carbon by a methyl or 
trifluoromethyl group, or a halogen, such as a fluorine, chlorine, bromine or iodine 
atom. 

5 In a preferred embodiment, one of R A1 and and one of R D1 and R D2 are, 

independently, a phenyl, substituted phenyl, naphthyl, substituted naphthyl, 
naphthylmethyl, substituted naphthylmethyl, benzyl or substituted benzyl group, or 
a group of the formula 




10 where J is O, S or NR, where R is H or linear, branched or cyclic C r C 6 -alkyl, 

preferably methyl. R„ R 2 , R 3 , R 4 and R 5 are independently selected from the group 
consisting of hydrogen, halogen and alkyl, preferably, linear, branched or cyclic C,- 
C 4 -alkyl, such as methyl. Suitable phenyl, naphthyl, naphthylmethyl and benzyl 
substituents include alkyl, preferably linear, branched or cyclic C,-C 4 -alkyl, such as 

15 methyl; and halogen, such as flourine, chlorine, bromine or iodine. More preferably, 
R A! and R DI are both hydrogen, and R^ and R D2 are each, ndependently, one of the 
foregoing groups. 

Preferably, one of R BI and R B2 is hydrogen, substituted or unsubstituted 
linear, branched or cyclic C r C 4 -alkyl, phenyl, benzyl, naphthyl or naphthylmethyl. 
20 Suitable substituents include linear, branched or cyclic C,-C 4 -alkyl groups and 
halogens, such as fluorine, chlorine, bromine or iodine. More preferably, R BI is 
hydrogen and Rg 2 is one of the foregoing groups. 

Preferably, one of R EI and R K is a substituted or unsubstituted, linear, 
branched or cyclic C,-C 6 -alkyl group or a substituted or unsubstituted phenyl or 
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naphthyl group. Suitable substituents include linear, branched or cyclic C,-C 4 -alkyl 
groups, such as methyl, and halogens, such as fluorine, chlorine, bromine and 
iodine. More preferably, R E j is hydrogen and is one of the foregoing groups. 
In a preferred subset of the compounds of formula I, A and D are each a D- 

5 tryptophan residue and E is a D-leucine residue. 

Preferably, K is a D-amino acid residue or an N-substituted glycyl residue 
comprising an amino-, carboxyl- or sulfhydryl substituted side chain, such as a 
cysteine, glutamic acid, aspartic acid or lysine residue, and L is a polypeptide 
comprising 2 or 3 D-amino acid residues, L-amino acid residues (the D- or L-amino 

1 0 acid residues can be the same or different) or N-substituted glycine residues. For 
example, in one embodiment, L comprises 2 or 3 residues selected from among D- 
glycine, D-alanine or D-<x-C r C 4 -alkylglycine. 

Preferably, M is a polypeptide group comprising from 2 to about 8 D-amino 
acid residues, of which at least one comprises an amino-, carboxy- or sulfhydryl 

15 substituted side chain, such as a cysteine, glutamic acid, aspartic acid or lysine 
residue. N is, preferably, a polypeptide group comprising from 1 to about 6 amino 
acid residues, of which at least one is a lysine residue. 

The identity of divalent linking group F is not critical, as long as it is of a 
suitable length to position residues A to E to interact with the N-helix coiled coil 

20 cavity (J.R. Morphy, Curr. Op. DrugDiscov. Develop., 7:59-65 (1998)). For 
example, F preferably has a length from about 2 to about 40 atoms. In one 
embodiment, F is a direct bond or a polypeptide linking group of the formula -P n -, 
wherein n is 1 to about 12 and each P is independently an L- or D- amino acid or N- 
substituted glycyl resdiue residue, a glycyl residue or an N-substituted glycyl 

25 derivative. 

In another embodiment, F is a substituted or unsubstituted C^C^-alkylene 
group, such as a polymethylene group of the formula -(CH 2 ) m -, wherein m is from 
about 4 to about 40; an alkylene group which is interrupted at one or more points by 
a heteroatom, such as a nitrogen, oxygen or sulfur atom. For example, F can be a 
30 group (CH 2 CH 2 0) q -, wherein q is from 1 to about 20. F can also be an alkylene 
group which is interrupted at one or more points by a phenylene or heteroarylene 
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group, or a polysaccharide group, for example, a glycoside or poly(glycoside) group 
comprising one or more glycoside groups, for example, from 1 to about 10 glycoside 
groups. Suitable glycosides include glucoside, lactoside, mannoside, galactoside, 
fucoside, fructoside, guloside, alloside, altroside, taloside, idoside and others, such 

5 as pyranosides and furanosides, which are known in the art. 

In compounds of Formula I having a C-terminal amino acid residue, the C- 
terminal residue can be, for example, in the form of an amide, an N-substituted 
amide or a carboxylic acid protecting group, as is known in the art. The nitrogen 
atom of an N-terminal residue can be acylated, for example, acetylated, or 

1 0 substituted with an amino protecting group, as is known in the art. 

The term "D-amino acid residue", as used herein, refers to an a-amino acid 
residue having the same absolute configuration as D-glyceraldehyde. When the 
amino acid residue includes a first non-hydrogen a substituent and a second a 
substituent selected from methyl and halogen, the absolute configuration is the same 

1 5 as that of D-glyceraldehyde with the second a substituent taking the place of the 
hydrogen atom at the glyceraldehyde ot-carbon. 

The peptides, portions of the peptides, variations/derivatives of the peptides 
or portions of the variations/derivatives described herein can be used as inhibitors of 
HIV entry into cells. The peptides represented in Figure 3 or a portion of a peptide 

20 sufficient to fit into the hydrophobic pocket at the C-terminal end of the coiled-coil 
and prevent interaction of the C-peptide region with the N-peptide region of gp41 
are useful to inhibit HIV infection. A portion of any of the peptides represented or 
of a derivative thereof can be from 2 to 20 (any number of residues from 2 to 20) 
amino acid residues in size. D-peptides which comprise the consensus sequence 

25 tryptophan-tryptophan-leucine or the sequence tryptophan-tryptophan-leucine- 

glutamate, described herein, and additional residues, can be used; the other residues 
present in such D-peptides and the size of the D-peptides can be selected with 
reference to peptides described herein or can be designed independent of those 
peptides, provided that these three or four residues are positioned in such a manner 

30 that the peptide can fit into the hydrophobic pocket and act as an inhibitor. 

Additional amino acid residues can also be present at the N-terminus, the C-terminus 
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or both of the D-peptides described herein, thus producing a larger peptide. 
Alternatively, there can be other amino acid residues selected, for example, to 
enhance binding affinity. Alternatively, a peptide which comprises the conserved 
amino acid residues of the D-peptides of Figure 3 can be used. For example, such a 

5 peptide can be 16 amino acid residues in size and include the conserved amino acid 
residues, which can be at the same positions as those at which they occur in the 
peptides shown in Figure 3. The intervening amino acid residues can be different 
from the amino acid residues at these positions in any of the peptides shown in 
Figure 3 (e.g., can be isoleucine or asparagine or other amino acid residue which 

1 0 does not appear in the peptides represented in Figure 3) or can be substituted for or 
replaced by an amino acid residue represented at a specific position in another 
peptide shown in Figure 3 (e.g., the aspartic acid residue in DlOpepl can be replaced 
by a serine residue). Amino acid residues other than the D-versions of the 20 L- 
amino acids found in natural proteins can be used. Such changes can be made, for 

1 5 example, to enhance bioavailability, binding affinity or other characteristic of the 
peptide. A D-peptide can comprise the conserved amino acid residues present in the 
peptides shown in Figure 3, but they can be separated by fewer (or more) amino acid 
residues than the number of intervening amino acid residues shown in Figure 3. For 
example, fewer than five amino acid residues (e.g., Tarrago-Litvak, L. et al, 

20 FASEB, X, 5:497 (1994); Tucker, TJ. et a/., Methods Enzymol, 275:440 (1996), 
Tarrago-Litvak, L. et al., FASEB, 1, 5:497 (1994); Tucker, TJ. et al., Methods 
EnzymoL, 275:440 (1996)), can be present between the first cysteine and the 
glutamic acid in the consensus sequence shown in Figure 3. Alternatively, these two 
residues can be separated by more than five amino acid residues. Internal 

25 modifications can also be made (e.g., to enhance binding or increase solubility of a 
peptide). For example, the first tryptophan of DIOpepS can be replaced by an 
arginine to increase solubility. A D-peptide can have additional moieties or amino 
acids at its N-terminus. For example, a moiety which blocks the N terminus or gets 
rid of the charge otherwise present at the N-terminus can be added. The moiety can 

30 be, for example, a blocking moiety, such as an acetyl group linked directly to the 
glycine (G), or an acetyl group linked to one or more additional amino acid residues 
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linked to the N-terminal of G, such as an acetyl group linked to one or more lysine 
residues, which, in turn, are linked to the N terminal G. In one embodiment, two 

lysine residues are linked to the N-terminal G (KKGAC ), for example to 

increase the solubility of the peptide; a blocking moiety, such as an acetyl group, can 

5 be linked to the terminal lysine (acetyl group KKGAC ). In another 

embodiment, four lysine residues are linked to the N-terminal G. In addition, a D- 
peptide can have additional and/or altered moieties or amino acids at its C-terminus. 
For example, one or both of the alanine residues at the C-terminus can be altered 
and/or one or more residues can be added at the C-terminus, for example to enhance 

10 binding. Alternatively, functional (chemical) groups other than amino acid residues 
can be included to produce an inhibitor of the present invention. For example, these 
additional chemical groups can be present at the N-terminus, the C-terminus, both 
termini or internally. In addition, two or more D-peptides can be linked via an 
appropriate linker (e.g., a linker of amino acid residues or other chemical moieties) 

1 5 to increase the effectiveness of inhibition. Alternatively, one or more D-peptides 
can be linked via an appropriate linker to a molecule (drug) that binds to HIV gpl20, 
CD4, CCR5, CXCR4, or a non-pocket region of HIV gp41 to increase the 
effectiveness of inhibition. 

The D-peptides (or L-peptides or peptides with both D- and L-amino acids) 

20 can be produced using known methods, such as chemical methods or recombinant 
technology. The polypeptide backbone can be altered (e.g., N-methylation) or 
replaced with alternative scaffolds (e.g., peptoids) at one or more positions of the 
peptides. Additional components can be included in the peptides, such as, for 
example, linkers (chemical, amino acid) which are positioned between amino acids 

25 or amino acid portions of the peptide (e.g., to provide greater flexibility or to provide 
greater rigidity). As described herein, the D-peptides of the present invention are 
flanked by GA at the N-terminus and AA at the C-terminus, due to the design of the 
library used in identifying the D-peptides. Some or all of these four amino acid 
residues may be altered, replaced or deleted in order to produce D-peptides with, for 

30 example, altered absorption, distribution, metabolism and/or excretion. In one 
embodiment, the C-terminus is modified by the addition of a glycine residue 
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immed iately before the C-terminal amide. In another embodiment, the most C- 
terminal A is altered/modified or replaced by a different amino acid residue or 
deleted. 

D-peptides, which are of the opposite handedness from the handedness of 
5 naturally-occurring peptides, do not serve as efficient substrates for enzymes, such 
as proteases, and, therefore, are not as readily degraded as L-peptides. In addition, 
there is no effective immune response which targets D-peptides and therefore, they 
do not elicit an immune response comparable to that elicited by L amino acid 
peptides. 

1 0 The present invention is illustrated by the following examples, which are not 

intended to be limiting in any way. 

Example 1 Synthesis of Variants of the C34 Peptide 

Mutant peptides were synthesized by solid-phase FMOC peptide chemistry 
and have an acetylated amino terminus and an amidated carboxy terminus. After 

15 cleavage from the resin, peptides were desalted with a Sephadex G-25 column 
(Pharmacia), and then purified by reverse-phase high-performance liquid 
chromatography (Waters, Inc.) on a Vydac CI 8 preparative column using a linear 
water-acetonitrile gradient and 0.1% trifluoroacetic acid. Peptide identities were 
verified by MALDI mass spectrometry (Voyager Elite, PerSeptive Biosystems). 

20 Peptide concentrations were measured by tryptophan and tyrosine absorbance in 6 M 
GuHCl [H. Edelhoch, Biochemistry, (5:1948 (1967)]. 



Example 2 Quantitation of Helical Content and Thermal Stability of Mutant 
N36/C34 Complexes 
CD measurements were performed in phosphate-buffered saline (50 mM 
25 sodium phosphate, 150 mM NaCl, pH 7.0) with an Aviv Model 62DS spectrometer 
as previously described (M. Lu, S. C. Blacklow, P. S. Kim, Nat. Struct. Biol, 2:1075 
(1995)). The apparent melting temperature of each complex was estimated from the 
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maximum of the first derivative of [6] 222 with respect to temperature. 
The mean residue ellipticities ([e] 222 , 10 3 deg cm 2 dmol* 1 ) at 0°C were as follows: 
wildtype, -31.7; Met 629 - Ala; -32.0; Arg 633 -Ala, -30.7; He 635 - Ala, -25.9; Trp 628 -Ala, 
-27.0; Trp 63! - Ala, -24.9. In the case of the Trp 628 - Ala and Tip 631 - Ala mutations, 
5 the decrease in [0] 222 is likely to overestimate the actual reduction in helical content. 
The removal of tryptophan residues from model helices has been reported to 
significantly reduce the absolute value of [Q] m even when there is little change in 
helical content (A. Chakrabartty, T. Kortemme, S. Padmanabhan, R. L. Baldwin, 
Biochemistry, 32:5560 (1993)). 

1 0 Example 3 Identification of Peptides Which Bind to a Pocket on the Surface of 
the N-helix Coiled-Coil of fflV-1 gp41. 

Methods are available to identify D-peptides which bind to a cavity on the 
surface of the N-helix coiled-coil of HIV envelope glycoprotein gp41. As described 
in detail below, D-peptides which bind to a cavity on the surface of the N-helix 

1 5 coiled-coil of HIV- 1 envelope glycoprotein gp4 1 were identified by mirror-image 
phage display. This method involves the identification of ligands composed of D- 
amino acids by screening a phage display library. D-amino acid containing ligands 
have a chiral specificity for substrates and inhibitors that is the opposite of that of 
the naturally occurring L-amino ligands. The phage display library has been used to 

20 identify D-amino acid peptide ligands which bind a target or desired L-amino acid 
peptide (Schumacher et al Science, 277.1854-1857 (1996)). 

D-peptides that bind to the hydrophobic pocket of gp41 were identified using 
a target that is an enantiomer of IQN17, a hybrid molecule containing 29 residues of 
GCN4-pI Q I on the N-terminal end and 1 7 residues of gp41 on the C-terminus. The 

25 phage library used for selection is described in U.S. Patent 5,780,221 and 

Schumacher et al Science, 277:1854-1857 (1996). The complexity of the library is 
greater than 10 8 different sequences. The sequences are flanked on either end by 
either a cysteine or a serine, with ten random residues in the middle. These 
sequences are located in the pill gene of the phage, a coat protein that is expressed 

30 as approximately five copies on the outer surface of the phage. 
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The following experimental procedures were used in the examples described 

herein. 

Phage Display 

Neutravidin (Pierce, 10 fig in 100 of 100 mM NaHC0 3 ) was added to 
5 individual wells of a 96-well high-binding styrene plate (Costar) and incubated 
overnight on a rocking platform at 4°C. The neutravidin was removed and the wells 
were washed four times with a TBS/Tween solution. Biotinylated D-IQN17 (100 
nL of a 10 nL peptide solution in lOOmM NaHC0 3 ) was added to the wells and 
incubated for one hour at 25°C. The biotinylated target was removed and a blocking 

10 solution (30 mg/ml nonfat dried milk in 100 mM NaHC0 3 ) was added to the wells 
and incubated for two hours, with rocking, at 4°C. The blocking solution was 
removed and the wells were coated again with the biotinylated target as above. The 
target was removed and the unliganded neutravidin was blocked by the addition of 
the blocking solution with 5 mM biotin. After removing the biotin, the wells were 

15 washed six times with the TBS/Tween solution. The phage stock was then added to 
the wells (50 jiL of phage stock plus 50 fiL of phage-binding buffer: TBS, 0.1% 
Tween-20, 1 mg/ml milk, 0.05% sodium azide). The incubation time of the phage 
stock in the wells decreased in increasing rounds of selection. After incubation, the 
phage solution was removed and the wells were washed twelve times with 

20 TBS/Tween to remove the unbound phage. Odd numbered washes were performed 
quickly, with no incubation time; even numbered washes were incubated for 
increasing amounts of time each round of phage selection. The phage were eluted 
by the addition of two micrograms of trypsin in 1 00 \iL of phage-binding buffer and 
2.5 mM CaCl 2 with an hour incubation at 37°C. To determine recovery, a dilution 

25 of the eluted phage was used to infect K 91 kan cells. After a one hour incubation, 
100 nL of cells were removed and 1:10, 1:100, and 1:100 dilutions in LB were 
plated on LB/tetracycline plates. Phage recovery was determined as a ratio of 
transducing units recovered (the titer of the eluted phage) to the input number of 
transducing units (the titer of the phage stock used that round). Transducing units 

30 were determined by counting the number of tetracycline-resistant colonies on the 
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LB/tetracycline plates. Non-specific phage recovery generally has a ratio in the 
order of magnitude of 10 8 to 10 9 , whereas specifically amplified phage have a ratio 
10' 7 or greater. Individual clones were amplified and sequenced. They were assayed 
in the binding assay to determine binding specificity. 
5 D 1 0pep7 was identified after five rounds of phage selection. D 1 Opep 1 , 

D10pep3, D10pep4, D10pep5, and D10pep6 were identified after seven rounds of 
phage selection. The phage selection was performed again, with shorter incubation 
times and longer washes, and DlOpeplO and D10pepl2 were identified after three 
rounds of selection. (A ninth D-peptide was identified but was not further 

1 0 investigated once it was shown to be toxic to cells.) 

To test the specificity of binding of identified phage clones to the pocket of 
D-IQN17, the phage clones were added to wells of 96-well plates coated as above 
with D-INQ17, D-GCN4-pI Q I (with the three mutations), D-IQN17(G39W = 
glycine36 substituted with tryptophan), or wells with no target. The phage were 

15 incubated on the plates and washed for the same lengths of time as in the round from 
which they were identified. Eluted phage were used to infect K91 kan cells and the 
recovered transducing units were determined as above. These sequences bound 
specifically to the wells with D-IQN17. 

Peptide Purification 

!0 IQN 1 7 and the D 1 0 peptides were synthesized by FMOC peptide chemistry. 

They have an acetylated N-terminus and a C-terminal amide. IQN 17 contains 29 
residues derived from GCN4-pI Q I on the N-terminus and 17 residues from the C- 
terminus of N36 on the C-terminus. There is one residue overlap between GCN4- 
plpl and the N36 region, making the peptide 45 residues long. To improve 

5 solubility, three amino-acid substitutions were made in the GCN4-pIqI region of 
IQN17, as compared to the original GCN4-pI Q I sequence (Eckert, D.M. et al., J. 
Mol. Biol, 284:259-865 1998). These substitutions are L13E, Y17K, and H18K. 
Thus, the sequence of IQN7 is: 

ac-RMKQIEDKIEEIESKQKKIENEIARIK KI J ,QT .TVWGTKQI .QAR TT ,-am 
0 (ac- represents an N-terminal acetyl group and -am represents a C-terminal amide), 
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with the HIV portion underlined. For mirror-image phage display, IQN17 was 
synthesized using D-amino acids (for amino acid residues that contain a second 
chiral center, such as He and Thr, the exact mirror image of the naturally occurring 
amino acid residue is used to create the D-version of the target). In addition, the N- 
5 terminus of the peptide was biotinylated using NHS-LC-biotin II (Pierce, catalog 
#21336). Between the biotin and the IQN1 7 sequence was a three amino acid linker 
of GKG, with the lysine in the naturally-occurring L-form. This lysine was inserted 
as a trypsin recognition site. 

The sequences of the D-peptides are as follows (with all amino acids in the 

1 0 D-enantiomer, using the exact mirror image of naturally occurring amino acid 
residues for He and Thr, which contain a second chiral center): 
DlOpepl : Ac-GACEARHREWAWLCAA-CONH 2 (SEQ ID NO: 34); 
D10pep3: Ac-KKGACGLGQEEWFWLCAA-CONH 2 (SEQ ID NO: 15); 
D10pep4: Ac-GACDLKAKEWFWLCAA-CONH 2 (SEQ ID NO: 35); 

15 DIOpepS: Ac-KKGACELLGWEWAWLCAA-CONH 2 (SEQ ID NO: 16); 
D10pep6: Ac-GACSRSQPEWEWLCAA-CONH 2 (SEQ ID NO: 36); 
D10pep7: Ac-GACLLRAPEWGWLCAA-CONH 2 (SEQ ID NO: 37); 
DlOpeplO: Ac-KKGACMRGEWEWSWLCAA-CONH 2 (SEQ ID NO: 18); and 
D10pepl2: Ac-KKGACPPLNKEWAWLCAA-CONH 2 (SEQ ID NO: 19). 

20 After cleavage from the resin, the peptides were desalted on a Sephadex G- 

25 column (Pharmacia) and lyophilized. The lyophilized peptides were purified by 
reverse-phase high performance liquid chromatography (Waters, Inc.) on a Vydac 
CI 8 preparative column. The D-peptides were then air-oxidized by dissolving the 
lyophilized powder in 20 mM Tris, pH 8.2, and stirring at room temperature for 

25 several days. The oxidized peptides were HPLC purified as before. The expected 
molecular weights of the peptides were verified using MALDI-TOF mass 
spectrometry (PerSeptive Biosystems). Peptide concentrations were determined 
using tyrosine, tryptophan and cysteine absorbance at 280 nm in six molar GuHCl 
(Edelhoch, 1967). Peptide stock solutions were prepared in DMSO. 

30 The N-terminal lysines on D10pep3, D10pep5, D10pep7a, DlOpeplO and 
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D 1 Opep 1 2 were added to increase the water solubility of the peptides. To 
investigate the effect of the added lysines on the inhibitory activity of the peptides, 
DlOpepl was synthesized with two N- terminal lysines (denoted Dl Opep la) and 
compared to DlOpepl without lysines: DIOpepla was found to have an IC 50 for 
5 inhibition of syncytia formation approximately 2-fold higher than DlOpepl (i.e., 
without lysines). In addition, D10pep5 was synthesized with two additional N- 
terminal lysines (for a total of four lysines to generate a peptide denoted D10pep5a). 
The IC 50 for inhibition of syncytia formation of D10pep5a was approximately 2-fold 
higher than D10pep5. The addition of N-terminal lysine residues to the D-peptides 
1 0 results in only a modest decrease of inhibitory activity. 

D-peptides that had additional D-Lys residues added to the N-termini, that 
were synthesized for study are indicated with the addition of "a" to the peptide name 
and include the following: 

DIOpepla: Ac-KKGACEARHREWAWLCAA-CONH 2 (SEQ ID NO: 38); 
15 D 1 0pep4a: Ac-KKGACDLKAKEWF WLC AA-CONH 2 (SEQ ID NO: 39); 
D 1 0pep5a: Ac-KKJCKGACELLG WEWAWLCAA-CONH 2 (SEQ ID NO: 1 7); 
D 1 0pep6a: Ac-KKGACSRSQPEWE WLCAA-CONH 2 (SEQ ID NO: 40); and 
D10pep7a: Ac-KKGACLLRAPEWGWLCAA-CONH 2 (SEQ ID NO: 41). 

These sequences are also represented in Figure 3. The 12 amino acid "core" of each 
20 D-peptide (which, in turn comprises a 10-mer and the consensus sequences 
described herein) are as follows: 

CDLKAKEWFWLC (SEQ ID NO: 3) 

CEARHREWAWLC (SEQ ID NO: 4) 

CELLGWEWAWLC (SEQ ID NO: 5) 

25 CLLRAPEWGWLC (SEQ ID NO: 6) 

CSRSQPEWEWLC (SEQ ID NO: 7) 

CGLGQEEWFWLC (SEQ ID NO: 8) 

CMRGEWEWSWLC (SEQ ID NO: 9) 

CPPLNKEWAWLC (SEQ ID NO: 10) 

30 CVLKAKEWFWLC is an alternative sequence for peptide SEQ ID NO: 3. 

(SEQ ID NO: 11). 
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It is readily apparent that there is a highly conserved consensus sequence in these 
peptides. The 12 amino acid peptide represented in Figure 3 can be represented as: 
CXXXXXEWXWLC (SEQ ID NO: 12), where amino acid residues common to the 
peptides are shown and X represents an amino acid residue which is not conserved 
5 among the peptides. 



Example 4 Assessment of Activity of C34 Peptides and D-Peptides 

The potency of C34 peptides in inhibiting viral infection and the HIV-1 
infection inhibitory activity of the D-peptides were assayed using recombinant 
luciferase-expressing HIV-1 (Chen, B.K. etalj. Virol., 68:654 (1994); 

10 Malashkevich, V.N., et al Proa Natl Acad. ScL, USA, 95:9134 (1998)). The virus 
was produced by co-transfecting an envelope-deficient HIV genome NL43LucR-E- 
(Chen, B.K. et al., J. Virol , 68:654 (1994) and the HXB2 gpl60 expression vector 
pCMVHXB2gpl60 (see Chan, D.C. et al, Proc. Natl Acad. ScL, 95:1 1513 (1998)) 
into 293T cells. Low-speed centrifiigation was used to clear the viral supematants 

1 5 of cellular debris. The supernatant was used to infect HOS-CD4/Fusion cells (N. 
Landau, NIH AIDS Reagent Program) in the presence of the D-peptides, with 
concentrations ranging from 0 to 500 ^M. Cells were harvested 48 hours post- 
infection, and luciferase activity was monitored in a Waliac AutoLumat LB953 
luminometer (Gaithersburg, MD). The IC 50 is the peptide concentration that results 

20 in a 50% decrease in activity relative to control samples lacking peptide. The IC 50 
was calculated from fitting the data to a Langmuir equation [y=k/(l+([peptide]/IC 50 ) 
+ x], where y = luciferase activity and k and x are scaling constants. 



Cell/Cell Fusion Assay 

Inhibition of cell/cell fusion (i.e., syncytia formation) was assayed by co- 
25 culturing Chinese hamster ovary cell expressing HXB2 envelope (K. Kozarsky, et 
al, I Acquis Immune. Defic. Syndr., 2:163 (1989) and the HeLa-CD4-LTR-Beta- 
gal cells (M. Emerman, NIH AIDS Reagent program) in the presence of varying 
concentration of peptide. When mixed, these cells form syncytia, or multi-nucleated 
cells, which express P-galactosidase. Approximately twenty hours after co-culturing 
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the cells, the monolayers were stained with 5-bromo-4-chloro-3-indolyl-P-D- 
galactoside to visualize the syncytia. The syncytia are visualized with a microscope 
and counted manually (a syncytia is scored as a fused cell containing three or more 
nuclei). The IC 50 was calculated from fitting the data to a Langmuir equation [y = 
5 k/(l + [peptide]/IC 50 ) + x], where y = number of syncytia and k and x are scaling 
constants. 



Table 1 Stability of mutant N36/C34 complexes and the inhibitory potency 

of C34 mutants. 



Peptide 


T ra (°Q 


IC J0 (nM) viral entry 


IC J0 (nM) cell fusion 










Wildtype 


66 


2.1 ±0.31 


0.55 ± 0.03 


















Trp <28 ^Ala 


53 


10 ±2.0 


3.8 ±0.33 


Trp <31 _Ala 


37 


61 ± 16 


15 ± 0.82 


Ile 635 -+Ala 


55 


4.1 ±0.91 


0.96 ±0.12 










Control 
residues 








Met 629 -Ala 


66 


2.0 ± 0.27 


0.74 ± 0.03 


Arg 633 -Ala 


65 


2.6 ±0.89 


0.76 ± 0.07 



Mutant C34 peptides (10 nM) were complexed with the N36 peptide (10 
^iM) in phosphate-buffered saline (pH 7.0) for circular dichroism (CD) 
measurements. The apparent melting temperatures (T m ) were estimated from the 
thermal dependence of the CD signal at 222 nm. Inhibition of viral entry was 
5 measured in a cell-culture infection assay using recombinant luciferase-expressing 
HIV-1 . Inhibition of cell-cell fusion was measured in a syncytium assay. The 
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means and standard errors are from triplicate trials. 

Similarly, the activity of the D-pep tides described was assessed using the 
two assays described above. Results are shown in figures 6A-6B and 8A-8B. 



Example 5 : Crystallization of the IQ 1 7/D 1 Opep 1 Complex and Ligand-Free 
5 IQN17 

Peptide Purification, Crystallization 

Peptides IQN17 and Dl Opep 1 were synthesized by FMOC peptide 
chemistry, as described above. 

A 10 mg/ml stock of a mixture of IQN17 and DlOpepl was prepared in 

10 water. The final concentration of IQN17 was about 1.37 nM, and the final 

concentration of DlOpepl was about 1.51 mM. Initial crystallization conditions 
were found using Crystal Kits I and II (Hampton Research), and then optimized. To 
grow the best diffracting crystals, one microliter of this stock was added to one 
microliter of the reservoir buffer (10% PEG 4000, 0.1 M NaCi pH 5.6, 20 % 2- 

1 5 propanol) and allowed to equilibrate against the reservoir buffer. Crystals belong to 
a space group P321 (a=b=41.83A; c=84.82A, a=P=90°, y=120°) and contain one 
IQN1 7/D 1 Opep 1 monomer in the asymmetric unit. A useful osmium derivative was 
produced by increasing the concentration of PEG 4000 in the reservoir solution by 
4%, adding (NH 4 ) 2 0sCl 6 to the reservoir solution to a final concentration of 5 mM 

20 and adding five microliters of the resulting solution to the drop containing the 
protein crystal. Prior to data collection native and heavy-atom derivative crystals 
were transferred into cryosolution containing 20% PEG 4000, 0.1 M NaCi PH 5.6, 
20% 2-Propanol and flash-frozen using X-stream cryogenic crystal cooler 
(Molecular Structure Corporation). 

25 The best diffracting crystals of ligand-free IQN1 7 were grown with a similar 

technique as above: on microliter of a 10 mg/ml solution of IQN1 7 in water was 
added to one microliter of the reservoir buffer (1 .0 M K,Na Tartrate, 0. 1 M 
NaHEPES pH 7.0) and allowed to equilibrate against the reservoir buffer. Before 
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flash freezing, the crystals were transferred into buffers consisting of the reservoir 
solution with increasing amounts of glycerol, up to a final concentration of 23% 
glycerol. Crystals belong to the space group C222, (a= 57.94 A, b=121.96 A, c= 
73.67 A; a=P=y=90°) and contain one IQN17 trimer in the asymmetric unit. 

5 X-Ray Data Collection and Processing 

Initial data were collected on a Rigaku RU300 rotating-anode x-ray generator 
mounted to an R-axis IV area detector (Molecular Structure Corporation). 
Diffraction data for IQN17 .were collected at 100 K using a Quantum-4 CCD 
detector and the 5.0.2 beamline at the Advanced Light Source (Berkeley, USA). 

10 Final native and multiwavelength anomalous diffraction (MAD) data for 

IQN17/D10pepl were collected at the Howard Hughes Medical Institute Beamline 
X4A at Brookhaven National Laboratory using a Raxis-IV detector. For MAD 
data, four wavelengths near the osmium L-III absorption edge were selected based 
on the fluorescence spectrum of the Os derivative crystal (Table 2). The four 

15 wavelengths were: 1.1398 A, 1.1403 A, 1.1393 A, 1.1 197 A. Datasetswere 

collected in 20° batches, allowing the same batch to be collected at each wavelength 
before moving to the next batch, in order to minimize the crystal decay between data 
sets. Reflections were integrated and scaled with the programs DENZO and 
SCALEPACK (Otwinowski, Z., (1993) in Data Collection and Processing, eds. 

20 Sawer, L., Isaacs, N. & Bailey, S. (SERC, Daresbury Laboratory, Warrington, 
England), pp. 55-62). 

Further diffraction data processing, phase determination and map 
calculations were performed using the CCP4 suite of programs (CCP4, Acta Cryst. 
Z)50;76O-763 (1994)). Intensities were reduced to amplitudes with the program 

25 TRUNCATE, and the data sets for the wavelengths closest to the Os L-III 

absorption edge (Al t A2, A3) were scaled with SCALEIT to the remote wavelength 
(A4) data set (Table 2). 

Phase Determination and Crystallographic Refinement 

Initially, phase determination for IQN17/D10pepl crystals was attempted 
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with the molecular replacement technique using the theoretical model of IQN17 
build from the published GCN4-pI Q I and HIV gp41 structures (Eckert, D.M., et al 
(1998) J. Mol Biol 254:859-865; Chan, D.C, et al (1997) Cell 89, 263-273) with 
sidechains truncated to a polyserine chain. The resulting molecular replacement 
5 solutions were ambiguous and the electron density map did not reveal conformation 
of the DlOpepl peptide. The molecular replacement phases were good enough, 
however, for determining the coordinates of a single Os atom in the corresponding 
derivative using difference and anomalous fourier maps. The heavy atom binds on 
the cryallographic three-fold axis (0.333, 0.667, 0.047). MAD phases were then 

10 generated with the program MLPHARE (Table 2) and extended to higher resolution 
with the program DM. The quality of MAD electron density map at 1.5 A resolution 
was exceptional, and revealed structural details of IQN17 and DlOpepl peptide with 
clarity. Electron density map interpretation and model building was done with the 
program O (Jones, T.A. et al (1991) Acta Crystallogr. D47, 110-119). The structure 

15 of IQN17-D10pepl complex was refined using the program CNS (Briinger, A.T. et 
al, Acta Crystallogr, D54, 905-921 (1998)). The correctness of the structure was 
checked with simulated annealing omit maps and with the program WHAT CHECK 
(Hoff, R.WW. et al, Nature 381: 272 (1996)). All residues of IQN17 and the 
DlOpepl peptide (when converted into its mirror image) occupy most preferred 

20 areas of the Ramachandran plot. The conformations of the majority of the residues 
are well defined except for the two most N-terminal residues of IQN17 and the side 
chains of Arg-6 and Arg-8 of the DlOpepl peptide. 

The structure of iigand-free IQN17 was solved by molecular replacement 
using the program AMORE (Navaza, J. (1994) Acta Crystallogr. A50, 157-163) and 

25 the IQN17 part of the refined IQN17/D10pepl structure as a test model. Three-fold 
noncrystallographic averaging, solvent flattening and histogram matching with the 
program DM was used for phase improvement. Electron density map interpretation 
and model building was done with the program O (Jones et al., Acta Crystallogr. 
D54, 905-921 (1991). The structure of the IQN17/D10pepl complex was refined 

30 using the program CNS (Brunger, A.T. et al, Acta Crystallogr. D54, 905-921 
(1998)). 
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The crystal structure can be used to design more effective and/or new D- 
peptides, peptidomemetics or other small molecules that inhibit HIV infectivity. 

Example 6 Nuclear magnetic resonance (NMR) methods for identifying 

compounds which bind to the N-helix hydrophobic pocket of gp41 

5 A. Assaying specific binding between the IQN1 7 hydrophobic pocket and D- 
peptides 

NMR experiments were used to assay the binding of each D-peptide to 
IQN17. The single tryptophan residue of IQN17 (denoted Trp-571) provides an 
excellent probe of specific binding to the hydrophobic pocket of gp41 . In deuterium 

10 oxide (deuterated water) buffers, the simple homonuclear one-dimensional *H NMR 
spectrum of IQN17 (Figure 9A, middle) shows five signals from the Trp-571 indole, 
extremely well-resolved from all other signals in the molecule. To test a compound 
for binding to the gp41 pocket, two one-dimensional ! H NMR measurements were 
made on samples in deuterated buffers. First, a reference (control) spectrum of 

15 IQN17 was taken, identifying the Trp571 chemical shifts in the unbound form. A 
second spectrum was acquired on a sample containing both IQN17 and the 
compound in question. An optional third specthim of the D-peptide (or other small 
molecule, or mix of molecules) was also taken. ] H NMR experiments were 
performed on a Bruker AMX 500 spectrometer. Data was processed in Felix 98.0 

20 (MSI) on Silicon Graphics computers, and all spectra were referenced to DSS. All 
experiments were performed at 25° C in 100 mM NaCl, 50 mM sodium phosphate 
(pH 7.5). All buffers used were >99.7% D 2 0, to remove overlapping resonances 
from exchangeable backbone and side chain protons. Solute concentrations ranged 
from 0.3-1.0 mM for individual peptides, 0.8-1.0 mM for 1:1 commplexes of IQN17 

25 with each D-peptide. 

Simple binding of two or more components is expected to result both in 
broader peaks (due to the increased size of the complex) and in changes in chemical 
shifts (due to the different chemical environments experienced by nuclei in free and 
bound forms). Specific binding to the hydrophobic pocket is indicated by a change 
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in the Trp-571 chemical shifts, as well as by a broadening of peaks. Binding can 
also be indicated by similar changes in the chemical shifts and peak widths of the 
molecule (peptides and small organic molecules, for example) assayed. Figure 9A 
shows an example of these effects: the NMR spectrum of the IQN17/D10pepla 
5 complex displays broader peaks and dramatically different chemical shifts than the 
spectra for either of the two separate components. All IQN17/D-peptide complexes 
studied gave similar results, though varying in the degree of chemical shift 
dispersion (Figure 9B). Thus, binding was indicated in all cases. 

The x-ray crystallographic finding that the two conserved Trp residues, and 

10 the conserved Leu residue, in DlOpepl are directly involved in the binding of the 
IQN17 pocket, strongly suggests that these conserved residues participate in a 
similar manner when the other D-peptides bind the pocket. These conserved 
trypophan residues, and Trp-571 of IQN17, provide an opportunity to study the 
binding interfaces in greater detail. In the IQN17/D10pepl crystal structure, the 

15 Trp-571 sidechain of IQN17 is in close contact with Trp-10 of DlOpepl, with 

several protons of Trp-571 (H C2 , H^ 2 , H C3 , H €3 ; the four scalar-coupled protons of the 
aromatic ring) above the plane of the Trp-10 indole group. In this position, aromatic 
ring current interactions (F.A. Bovey, Nuclear Magnetic Resonance Spectroscopy 
(1988)) are expected to alter the chemical shifts of some of those protons, moving 

20 peaks upfield in the manner seen (Figure 9A, bottom). Use of the structure-based 
chemical shift prediction program SHIFTS (version 3.0b2, K. Osapay, D. Sitkoff, D. 
Case) also predicted that only protons from Trp-571 will experience a large upfield 
shift, expecially the H C3 proton. If the other D-peptides bind to the IQN1 7 pocket in 
the same fashion as DlOpepl, a similar juxtaposition of Trp-571 and Trp-10 should 

25 occur, resulting in upfield-shifted peaks. All of the D-peptide/IQN17 complexes 
studied displayed such peaks, though varying in the extent of the shift (Figure 9B). 
The DlOpepl complex showed the most extreme upfield shifts, and the D10pep7a 
complex the least. The magnitude of these changes is very large, ranging from 
roughly 0.5 to 2 ppm for the most upfield-shifted proton (H (3 , in all cases where it 

30 could be assigned). In comparison, chemical shift differences often used to detect 
binding in SAR by NMR experiments (Shuker, S.B., Hajduk, P J., Meadows, R.P., 



WO 00/06599 



PCT/US99/17351 



-64- 

Fesik, S.W., Science 274:1531-1534 (1996)) are frequently in the range of 0.05 to 
0.2 ppm.) Though a broad range of upfield chemical shifts was observed, ring- 
current effects can be highly sensitive to distance and orientation, so that small 
structural differences may give rise to substantial variations in chemical shift. (All 
5 of the upfield shifts observed are consistent with the approximate orientation of Trp 
side chains expected from the x-ray crystal structure.) Also, the upfield-shifted 
peaks are somewhat broadened compared to others in these NMR spectra (most 
likely due to some type of exchange process) an effect particularly pronounced for 
the complexes with D10pep5a and with D10pep7a. 

10 To confirm that the strongly upfield-shifted peaks all correspond to a single 

sidechain (almost certainly Trp-571), two-dimensional NMR (TOCSY) experiments 
were performed on each of the IQN17/D-peptide complexes. As expected, the 
TOCSY experiments indicate that in each complex, the strongly upfield-shifted 
resonances all belong to the same aromatic side chain, identified as a group of four 

1 5 scalar-coupled protons. One example TOCSY spectrum is shown in Figure 9C. For 
several of the complexes studied, NOESY experiments also indicate contact between 
this sidechain and other (unassigned) aromatic groups, as expected from the 
IQN17/D10pepl structure. Not all of the potential NOE crosspeaks could be 
resolved, due to intense spectral overlap in the 6.8-7.6 ppm region. 2D NOESY and 

20 TOCSY experiments as described in J. Cavanaugh, W. J. Fairbrother, A.G. Palmer, 
N.J. Skelton, Protein NMR Spectroscopy: Principles and Practice (1996) were 
performed on samples of IQN17 and of each complex, with mixing times ranging 
between 30-90 ms (NOESY) and 30-70 ms (TOCSY). Spectral widths of 1 1 , 1 1 1 Hz 
and 5555 Hz were used in the acquisition (t 2 ) and indirect (t,) dimensions, 

25 respectively. TOCSY experiments employed the DIPSI-2rc mixing sequence (J. 
Cavanaugh, M. Ranee, J. Magn. Reson. Serv. A, 705:328 (1993)). 

We conclude that all D-peptides assayed clearly bind the hydrophobic pocket 
of IQN17. Additionally, in the majority of these IQN17 complexes (i.e., DlOpepl, 
D10pep3, D10pep4, D10pep6, DlOpeplO, and D10pepl2) the D-peptides contact 

30 the pocket with very similar binding interfaces, bringing Trp-571 in close contact 
with the aromatic ring of Trp- 10. In the cases of complexes with D10pep5a and 
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D10pep7a this conclusion also seems very likely, although the more limited 
chemical shift dispersion and broader peaks raise a remote possibility of some other 
mode of binding. 

The binding assay employed here can also be employed to assay binding of 
5 other molecules to the hydrophobic pocket of gp41 (e.g., such as found in IQN1 7). 
The assay is especially easy to interpret in a case where an aromatic group binds the 
pocket, as with the set of D-peptides described above. However, any pocket-binding 
molecules should also perturb the chemical shifts of Trp-571, an easily noticeable 
effect. In addition, new NMR signals generated by the small molecules themselves 

1 0 upon binding, are also indicative of binding. 

The use of one-dimensional homonuclear *H NMR provides significant 
advantages over multidimensional heteronuclear NMR to determine specific 
binding: (1) Sensitivity is higher, allowing samples to be assayed more quickly; 
alternately the higher sensitivity makes possible the use of lower concentrations of 

15 IQN17 and of putative binding agents, allowing screening for higher-affinity 

compounds, and more of them simultaneously. (2) Non-isotopically labeled proteins 
are simpler to produce, and more cost-effective. However, two-dimensional NMR 
experiments, either homonuclear or heteronuclear (with 15 N and/or l3 C isotopic 
labeling) could also be employed. 

20 B. Screening chemical libraries 

The binding assay described in (A) above can be used to screen large 
numbers of compounds present in a chemical library. Simple one-dimensional 
homonuclear ! H NMR experiments are sufficient to assess binding, with no 
requirement for isotopic labeling. Two-dimensional NMR experiments, either 

25 homonuclear or heteronuclear (with l5 N and/or I3 C isotopic labeling) could also be 
employed. Single compounds can be screened one at a time in this process. 
However, multiple compounds can also be combined in the same assay with IQN1 7 
(or any representation of the gp41 N-helix coiled coil) and screened simultaneously. 
Binding to the pocket by any component of the mixture is indicated by a change in 

30 the Trp-571 chemical shifts. NMR signals from a large number of compounds 
together have the potential to obscure signals from Trp-571 ; these signals from 
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unbound molecules can be eliminated using pulsed field gradient techniques well 
known in the art. With use of these techniques and a commercially available NMR 
tube sample changer, the automated screening of large numbers of compounds is 
straightforward. 

5 C. Evaluating the products of multiple combinatorial syntheses 

The screening process described in (B) above can also be extended to take 
advantage of combinatorial organic synthetic methods. Such methods are currently 
being used to generate whole families of compounds, with each family containing a 
diverse number of chemically related compounds. By the simple assay described 

10 above, the products of an entire combinatorial synthesis can be screened 

simultaneously. If no binding is indicated, then there is no need to invest further 
attention in any member of that family of compounds. If binding is indicated, then a 
particular family of promising compounds can be targeted for more detailed 
investigation. Simple one-dimensional homonuclear 'H NMR experiments are 

15 sufficient to assess binding, with no requirement for isotopic labeling. Two- 
dimensional NMR experiments, either homonuclear or heteronuclear (with 15 N 
and/or 13 C isotopic labeling) could also be employed. 
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Table 2. Data collection and refinement statistics 



Data collection 


Crystal 


X(A) 


Completeness (%) 


Rsvm 1 (%) 


Resolution (A) 


IQN17 


1.0000 


89.5 


3.7 


2.1 


IQN17/D10 


1.1197 


93.8 


4.8 


1.5 


Os X1 


1.1403 


98.6 


6.3 


2.0 


Os X2 


1.1399 


96.8 


9.7 


2.0 


Os X3 


1.1393 


96.9 


7.9 


2.0 


Os X4 


1.1197 


97.0 


8.4 


2.0 



MAD phasing statistics (22.0-2.0 A) 







Rcullis 3 


Rcullis 3 


Rcullis 3 


Ph. Power 4 


Ph. Power 4 


Occ. 5 


Anom. 


Derivative 


Riso 2 (%) 


Acentric 


Centric 


Anom. 


Acentric 


Centric 


Occ. 5 


Os X1 vs. X4 


7.3 


0.75 


0.61 


0.47 


1.41 


1.21 


-0.039 


0.337 


Os X2vs. X4 


5.2 


0.83 


0.71 


0.44 


1.04 


1.15 


-0.027 


0.533 


Os X3vs. X4 


3.3 


0.97 


0.97 


0.49 


0.35 


0.28 


-0.005 


0.295 



Overall figure of merit (before solvent flattening): 0.68 



Refinement statistics 





Crystal 


Non-hydrogen 
protein atoms 


Water 
s 


Ions 


Resolution 
(A) 


Reflections 
total 


Rcryst 6 


Rfree 6 


R.m.s. deviations 
bonds (A) angles 
(°) 


IQN17/D10 


516 


150 


1 


10.0-1.5 


13549 


0.214 


0.245 


0.012 1.498 


IQN17 


1143 


160 


1 


5.0 - 2.5 


7541 


0.282 


0.352 


0.009 1.252 



^sym = -^j|lj-<i>! ' — j|<M. where Ij is the recorded intensity of the reflection j and <i> is the mean 
recorded intensity over multiple recordings. 

2 Riso * 3l F (Xi) ± F(X4)I * F(Xi)ll ' -|F(X4)|. where F ( xi) is the structure factor at wavelength Xi and F<X4) is 
the structure factorlit the reference wavelength X4. 

3 Rcullis = ^F(/J) ± P(X4)| - |F h (Xi),cll / - i F (Xi )= F (X4)|. where r h(Xi)tC is the calculated heavy atom structure 
factor. 

4 Phase power = <Ph(/j)> ' £ . where <Fh(;j)> is the root-mean-square heavy atom structure factor and E 
is the residual lack of closure error. 
5 Occupancies are values output from MLPHARE. 

6 Rcryst. free = -iFoDsi - Fcaicll ' Fobs!/ where the crystallographic and free R factors are calculated using 
the working and test sets, resoectively. Test set contained 10% of reflections. 
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While this invention has been particularly shown and described with 
references to preferred embodiments thereof, it will be understood by those skilled 
in the art that various changes in form and details may be made therein without 
departing from the spirit and scope of the invention as defined by the appended 
5 claims. 
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CLAIMS 

What is claimed is: 

1 . A peptide which comprises^ soluble, trimeric form of a coiled-coil and a 
sufficient portion of the N-peptide region of HIV gp41 to comprise the 

5 amino acid residues which form the pocket of the N-helix coiled-coil of HIV 

gp41. 

2. The peptide of Claim 1 wherein the peptide is a D-peptide. 

3. The D-peptide of Claim 2 wherein the coiled coil is selected from the group 
consisting of: 

1 0 (a) the coiled coil of GCN4-pI Q I; 

(b) the coiled coil of GCN4-pII; 

(c) the coiled coil of Moloney Murine Leukemia Virus; and 

(d) the coiled coil of ABC heterotrimer. 

4. The D-peptide of Claim 3 wherein the amino acid sequence of the coiled coil 
15 is: 

RMKQIEDKIEEIESKQKKIENEIARIKK (SEQ ID NO: 25). 

5. The D-peptide of Claim 2 wherein the sufficient portion of the N peptide 
region of HIV gp41 comprises the sequence: LLQLTVWGIKQLQARIL 
(SEQ ID NO: 20). 

20 6. The D-peptide of Claim 5 which is IQN17 (SEQ ID NO: 2). 

7. A D-peptide which is a soluble, trimeric peptide model of the HIV gp41 

hydrophobic pocket, wherein the D-peptide comprises SEQ ID NO: 25 and a 
sequence which comprises 17 amino acid residues, wherein the 17 amino 
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acid residues comprise the sequence: LLXLTVWGXKXLQXRXX (SEQ ID 
NO: 42), wherein L, T, V, W, G, K, Q and R are amino acid residues 
represented by the single letter amino acid code and X is any D-amino acid 
residue. 

5 8. The D-peptide of Claim 7 wherein the sequence which comprises 1 7 amino 
acid residues is selected from the group consisting of: SEQ ID NO: 20; SEQ 
ID NO: 26; SEQ ID NO: 27 and SEQ ID NO: 42. 

9. A D-peptide selected from the group consisting of: 





(a) 


CDLKAKEWFWLC (SEQ ID NO: 3); 


10 


(b) 


CEARHREWAWLC (SEQ ID NO: 4); 




(c) 


CELLGWEWAWLC (SEQ ED NO: 5); 




(d) 


CLLRAPEWGWLC (SEQ ID NO: 6); 




(e) 


CSRSQPEWEWLC (SEQ ID NO: 7); 




(f) 


CGLGQEEWFWLC (SEQ ID NO: 8); 


15 


(g) 


CMRGEWEWSWLC (SEQ ID NO: 9); 




GO 
(i) 


CPPLNKEWAWLC (SEQ ID NO: 10); 
CVLKAKEWFWLC (SEQ ID NO: 11); 




0) 


KKGACGLGQEEWFWLC (SEQ ID NO: 15); 




00 


KKGACELLGWEWAWLC (SEQ ID NO: 16); 


20 


(1). 


KKKKGACELLGWEWAWLC (SEQ ID NO: 1 7); 




(m) 


KKGACMRGEWEWSWLC (SEQ ID NO: 18); 




(n) 


KKGACPPLNKEWAWLC (SEQ ID NO: 19); 




(o) 


a D-peptide comprising WXWL (SEQ ID NO: 23); 




(P) 


a D-peptide comprising EWXWL (SEQ ED NO: 24); 


25 


(q) 


a D-peptide comprising CXXXXXEWXWL (SEQ ED NO: 12) 




(r) 


ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 




(r) 


ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 38); 




(t) 


ac-FCKKKGACEARHREWAWLCAA-am (SEQ ED NO: 43); 




(u) 


ac-GACGLGQEEWFWLCAA-am (SEQ ID NO: 44); 
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(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 1 5); 

(w) ac-KKKKGACGLGQEEWFWLCAA-am (SEQ ED NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ED NO: 35); 

(y) ac-KKGACDLKAKEWFWLCAA-am (SEQ ED NO: 39); 

(z) ac-KKKKGACDLKAKEWFWLCAA-am (SEQ ID NO: 46); 

(a') ac-GACELLGWEWAWLCC-am (SEQ ID NO: 47); 

(b') ac-KKGACELLGWEWAWLCAA-am (SEQ ED NO: 1 6); 

(c') ac-KKKKGACELLGWEWAWLCAA-am (SEQ ED NO: 1 7); 

(d') ac-GACSRSQPEWEWLCAA-am (SEQ ID NO: 36); 

(e') ac-KKG ACSRS QPEWEWLC AA-am (SEQ ID NO: 40); 

(f ) ac-KKKKGACSRSQPEWEWLCAA-am (SEQ ID NO: 48); 

(g') ac-GACLLRAPEWGWLCAA-am (SEQ ID NO: 37); 

(h') ac-KKGACLLRAPEWGWLCAA-am (SEQ ID NO: 41); 

(i') ac-KKKKG AC LLRAPE WGWLCAA-am (SEQ ID NO: 49); 

(j ') ac-GACMRGEWEWSWLCAA-am (SEQ ED NO: 50); 

(k') ac-KKGACMRGEWEWSWLCAA-am (SEQ ED NO: 18); 

(1») ac-KKKKGACMRGEWEWSWLCAA-am (SEQ ID NO: 51); 

(m') ac-GACPPLNKEWAWLCAA-am (SEQ ID NO: 52); 

(n') ac-KKGACPPLNKEWAWLCAA-am (SEQ ID NO: 19); 

(o') ac-KKKKGACPPLNKEWAWLCAA-am (SEQ ID NO: 53); 

(p') ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 54); 

(q') ac-KKGACXXXXXEWXWLCAA-am (SEQ ID NO: 55); 

(r') ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID NO: 56); 

(s') ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 57); 

(f) ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 58); 

(u') ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 59); 

(v') ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 60); 

(w') ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ED NO: 61); 

(x') ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 62); and 

(y') a variant of a sequence of (a) through (x'), wherein the variant binds 
the N-helix coiled-coil cavity of HEV gp41, wherein ac- at the C- 
terminus and -am at the N-terminus are optional. 
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10. The peptide of Claim 1 wherein the peptide is an L-peptide. 



1 1 . The L-peptide of Claim 10 wherein the soluble, trimeric coiled-coil is 
selected from the group consisting of: 

(a) the coiled coil of GCN^pIgl; 
5 (b) the coiled coil of GCN4-pII; 

(c) the coiled coil of Moloney Murine Leukemia Virus; and 

(d) the coiled coil of ABC heterotrimer. 

12. The L-peptide of Claim 1 0 wherein the sufficient portion of the N peptide 
region of HIV gp41 comprises the sequence: LLQLTVWGIKQLQARIL 

10 (SEQ ID NO: 20). 

13. The L-peptide of Claim 12 which is IQN17 (SEQ ID NO: 2). 

14. An L-peptide which is a soluble, trimeric model of the HIV gp 1 hydrophobic 
pocket, wherein the L-peptide comprises SEQ ID NO: 25 and a sequence 
which comprises 17 amino acid residues, wherein the 17 amino acid residues 

1 5 comprise the sequence: LLXLTVWGXKXLQXRXX, wherein L, T, V, W, 

G, K, Q and R are amino acid residues represented by the single letter amino 
acid code and X is any D-amino acid residue. 

15. The L-peptide of Claim 14 wherein the sequence which comprises 1 7 amino 
acid residues is selected from the group consisting of: SEQ ID NO: 20; SEQ 

20 ID NO: 26; and SEQ ID NO: 27. 



1 6. A method of identifying a drug that interferes with formation of a complex 
between C34 peptide and N36 peptide, comprising: 
(a) combining a candidate drug to be assessed for its ability to interfere 
with formation of a complex between C34 peptide and N36 peptide, 
25 C34 peptide and N36 peptide, under conditions appropriate for 
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formatin of a complex between C34 peptide and N36 peptide, thereby 

forming a test sample; and 
(b) determining whether formation of a complex between C34 peptide 

and N36 peptide is inhibited, 
5 wherein if formation of the complex is inhibited, the candidate drug is a drug 

that interferes with formation of the complex whereby a drug that interferes 
with formation of the complex is identified. 

17. The method of Claim 16 wherein a control sample is formed by combining 
C34 peptide and N36 peptide, under the same conditions as the conditions 

1 0 under which the test sample is formed in (a); formation of a complex 

between C34 peptide and N36 peptide is determined and the extent to which 
the complex is formed in the test sample is compared with the extent to 
which the complex is formed in the control sample, wherein if the complex is 
formed to a lesser extent in the test sample than in the control sample, the 

1 5 candidate drug is a drug that interferes with formation of the complex, 

whereby a drug that interferes with formation of the complex is identified. 

18. The method of Claim 1 6 wherein C34 peptide and N36 peptide are each 
labeled by a member of a pair of donor-acceptor molecules and the extent to 
which formation of a complex between C34 and N36 occurs is assessed by 

20 determining the extent to which light emission occurs from the acceptor 

molecule, wherein if light emission occurs to a lesser extent in the presence 
of the candidate drug than in the absence of the candidate drug, the candidate 
drug is a drug that interferes with formation of a complex between C34 
peptide and N36 peptide. 

25 19. The method of Claim 1 7 wherein C34 peptide and N36 peptide are each 

labeled by a member of a pair or donor-acceptor molecules and the extent to 
which light emission occurs is assessed in the test sample and in the control 
sample, wherein if light emission is less in the test sample than in the control 
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sample, the candidate drug is a drug which inhibits formation of a complex 
between C34 prptide and N36 peptide. 



20. The method of Claim 16 further comprising assessing whether the drug that 
interferes with formation of the complex is an inhibitor of HIV entry into 

5 cells by assessing the effect of the drug on cell/cell fusion or HIV infection 

of cells is less in the presence of the drug than in its absence, the drug is an 
inhibitor of HIV entry into cells. 

21 . A method of eliciting an immune response in an individual, comprising 
introducing into the individual a peptide comprising a trimeric form of a 

10 coiled-coil region of a protein and a sufficient portion of the N-peptide 

region of HIV gp41 to comprise the amino acid residues which form part or 
all of the N-helix coiled-coil of HIV gp41 and the peptide is present in a 
pharmaceutically acceptable carrier. 

22. The method of Claim 21 wherein the peptide is introduced into the individual 
15 by a route of administration selected from the group consisting of: 

intramuscularly, intraperitoneally, orally, nasally and transdermally. 

23. The method of Claim 21 wherein the coiled-coil is selected from the group 
consisting of: GCN4-pI Q I; GCN4-pII; Moloney Murine Leukemia Virus and 
ABC heterotrimer. 



20 24. The method of Claim 21 wherein the peptide is IQN17. 

25. A method of interfering with entry of HIV into a mucosal cell comprising 
administering or applying to a mucosal surface a composition comprising: 
(1) a drug which binds HIV envelope protein gp4i subunit and interferes 
with entry of HIV into cells of the mucosal surface and (2) a carrier or base. 
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26. The method of Claim 25 wherein the drug binds the cavity on the surface of 
the N-helix coiled-coil of HTV envelope protein gp41 subunit. 

27. The method of Claim 26 wherein the drug prevents or reduces the gp41 
conformational change, thereby interfering with entry of HIV into cells of 

5 the mucosal surface. 

28. The method of Claim 25 wherein the composition comprises a component 
selected from the group consisting of: 

(a) C34 peptide; 

(b) DP178; 
10 (c) DP649; 

(d) T1249; 

(e) a derivative of (a) - (d); 

(f) a D-peptide which binds to the hydrophobic pocket of HTV gp4 1 ; 

(g) a derivative of (f); 

15 (h) a combination of two or more of (a)-(g); and 

(i) a molecule that inhibits HIV infectivity by binding to the N-helix 
coiled coil. 

29. The method of Claim 28 wherein the carrier or base is selected from the 
group consisting of: a foam, a gel, other substance sufficiently viscous to 

20 retain the drug, water and a buffer. 

30. The method of Claim 28 wherein the carrier or base is a vaginal suppository 
or rectal suppository. 



25 



31. 



The method of Claim 28 wherein the drug is released from the carrier or base 
immediately or soon after it is administered or applied to the vagina, mouth 
or rectum. 
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32. The method of Claim 28 wherein the drug is released from the carrier or base 

gradually or after a specified period after it is administered or applied 
to the vagina, mouth or rectum. 

33. The method of Claim 28 wherein the drug is on the surface of or 

5 incorporated within a contraceptive device in a manner which permits 

release of the drug under conditions of use. 

34. The method of Claim 28 wherein the D-peptide of (e) comprises an amino 
acid sequence selected from the group consisting of: 





(a) 


CDLKAKEWFWLC (SEQ ID NO: 3); 


10 


(b) 


CEARHREWAWLC (SEQ ID NO: 4); 




(c) 


CELLGWEWAWLC (SEQ ID NO: 5); 




(d) 


CLLRAPEWGWLC (SEQ ID NO: 6); 




(e) 


CSRSQPEWEWLC (SEQ ID NO: 7); 




(f) 


CGLGQEEWFWLC (SEQ ID NO: 8); 


15 


(g) 


CMRGEWEWSWLC (SEQ ID NO: 9); 




(h) 
(i) 


CPPLNKEWAWLC (SEQ ID NO: 10); 
CVLKAKEWFWLC (SEQ ID NO: 1 1); 




G) 


KKGACGLGQEEWFWLC (SEQ ID NO: 15); 




00 


KKGACELLGWEWAWLC (SEQ ID NO: 16); 


20 


w. 


KKKKGACELLGWEWAWLC (SEQ ID NO: 17); 




(m) 


KKGACMRGEWEWSWLC (SEQ ID NO: 18); 




(n) 


KKGACPPLNKEWAWLC (SEQ ID NO: 19); 




(o) 


a D-peptide comprising WXWL (SEQ ID NO: 23); 




(P) 


. a D-peptide comprising EWXWL (SEQ ID NO: 24); 


25 


(q) 


a D-peptide comprising CXXXXXEWXWL (SEQ ID NO: 12) 




(r) 


ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 




(r) 


ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 38); 




(0 


ac-KKKKGACEARHREWAWLCAA-am (SEQ ID NO: 43); 




(u) 


ac-GACGLGQEEWFWLCAA-am (SEQ ID NO: 44); 
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(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 15); 

(w) ac-KKKKGACGLGQEEWEWLCAA-am (SEQ ID NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKGACDLKAKEWFWLCAA-am (SEQ ID NO: 39); 

(z) ac-KKKKGACDLKAKEWFWLCAA-am (SEQ ID NO: 46); 

(a') ac-GACELLGWEWAWLCC-am (SEQ ID NO: 47); 

(b') ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 16); 

(c') ac-KKKKGACELLGWEWAWLCAA-am (SEQ ID NO: 1 7); 

(d') ac-GACSRSQPEWEWLCAA-am (SEQ ID NO: 36); 

(e') ac-KKGACSRSQPEWEWLCAA-am (SEQ ID NO: 40); 

(f ) ac-KKKKGACSRSQPEWEWLCAA-am (SEQ ID NO: 48); 

(g') ac-GACLLRAPEWGWLCAA-am (SEQ ID NO: 37); 

(h') ac-KKGACLLRAPEWGWLCAA-am (SEQ ID NO: 41); 

(i') ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ID NO: 49); 

(j ') ac-GACMRGEWEWS WLC AA-am (SEQ ID NO: 50); 

(k') ac-KKGACMRGE WEWSWLCAA-am (SEQ ID NO: 1 8); 

(1') ac-KKKKGACMRGEWEWSWLCAA-am (SEQ ID NO: 51); 

(m') ac-GACPPLNKEWAWLCAA-am (SEQ ID NO: 52); 

(n') ac-KKGACPPLNKEWA WLC AA-am (SEQ ID NO: 19); 

(o') ac-KKKKGACPPLNKEWAWLCAA-am (SEQ ID NO: 53); 

(p0 ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 54); 

(q'j ac-KKGACXXXXXEWXWLCAA-am (SEQ ID NO: 55); 

(r') ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID NO: 56); 

(s') ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 57); 

(f) ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 58); 

(uO ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 59); 

(v') ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 60); 

(wO ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 61); 

(x0 ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 62); and 

(yO a variant of a sequence of (a) through (x 0, wherein the variant binds 
the N-helix coiled-coil cavity of HIV gp41, wherein ac- at the C- 
terminus and -am at the N-terminus are optional. 
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A method of identifying a compound or molecule which binds the N-helix 
coiled-coil cavity of HIV- 1 gp41 envelope protein, wherein the compound or 
molecule to be assessed is referred to as a candidate inhibitor, comprising: 

(a) combining a D-peptide which binds the N-helix coiled-coil cavity, a 
fusion protein which is a soluble model which presents the N-helix 
coiled-coil cavity and a candidate inhibitor, under conditions 
appropriate for binding of the D-peptide to the N-helix coiled-coil 
cavity, thereby producing a test sample; 

(b) determining the extent to which binding of the D-peptide to the N- 
helix coiled-coil cavity in the test sample; and 

(c) comparing the extent of binding determined in to the N-helix coiled- 
coil cavity in a control sample, wherein the control sample is the 
same as the test sample except that the control sample does not 
include the candidate inhibitor and is maintained under the same 
conditions appropriate for binding of the D-peptide to the N-helix 
coiled-coil cavity as is the test sample, 

wherein if the extent of binding in the test sample is less than the extent of 
binding in the control sample, the candidate inhibitor is a compound or 
molecule which binds the N-helix coiled-coil cavity of HIV-1 gp41 envelope 
protein. 

The method of Claim 35 wherein the fusion protein is IQN17. 

The method of Claim 35 wherein the D-peptide is labeled with a fluorescent 
reporter and the fusion protein is labeled with a quencher which, when in 
sufficiently close proximity to the fluorescent reporter, quenches the signal 
from the reporter and detection of a signal from the fluorescent reporter 
indicates that the candidate inhibitor is a compound or molecule which binds 
the N-helix coiled-coil cavity of HIV-1 gp41 envelope protein. 

A fusion protein comprising a trimeric form of the coiled-coil region of 
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GCN4 and a portion of the N-peptide region of HIV-1 gp41, wherein the 
portion of the N-peptide region of gp4l comprises part or all or none of the 
N-helix coiled-coil pocket of HTV-1 gp41. 

39. A fusion protein of Claim 38 wherein the portion of the N-peptide region of 
5 HTV gp41 comprises the following 24 amino acid residues of HTV: 

SGrVQQQNNLLRAI EAQQHLLQLT. 

40. A method of eliciting an immune response in an individual, comprising 
introducing into the individual a fusion protein comprising a soluble, trimeric 
form of a coiled-coil and a sufficient portion of the N-peptide region of fflV- 

10 1 gp41, to comprise the amino acid residues which form the pocket of the N- 

helix coiled-coil of HIV-1 gp41, wherein the fusion protein is present in a 
pharmaceutically acceptable carrier. 

41. A D-peptide which comprises at least four amino acid residues and 
comprises the consensus sequence WXWL, wherein W represents D- 

1 5 tryptophan, L represents D-leucine and X represents any moiety. 

42. The D-peptide of laim 41 wherein X is a D-amino acid residue or a modified 
D-amino acid residue. 

43. The D-peptide of Claim 41 , wherein the D-peptide comprises 2 to 21 amino 
acid residues. 

20 44. A D-peptide which comprises at least five amino acid residues, wherein the 
at least five amino acid residues are EWXWL, wherein E represents D- 
glutamic acid, W represents D-tryptophan, L represents D-leucine and X 
represents an amino acid residue, a modified amino acid residue or a moiety 
other than an amino acid residue. 
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A D-peptide which comprises an amino acid sequence selected from the 
group consisting of: 

(a) CDLKAKEWFWLC (SEQ ID NO: 3); 

(b) CEARHREWAWLC (SEQ ID NO: 4); 

(c) CELLGWEWAWLC (SEQ ID NO: 5); 

(d) CLLRAPEWGWLC (SEQ ID NO: 6); 

(e) CSRSQPEWEWLC (SEQ ID NO: 7); 

(f) CGLGQEEWFWLC (SEQ ID NO: 8); 

(g) CMRGEWEWSWLC (SEQ ID NO: 9); 

(h) CPPLNKEWAWLC (SEQ ID NO: 10); 

(i) CVLKAKEWFWLC (SEQ ID NO: 1 1 ); 

(j) KKGACGLGQEEWFWLC (SEQ ID NO: 15); 

(k) KKGACELLGWEWAWLC (SEQ ED NO: 16); 

(1) KKKKGACELLGWE WA WLC (SEQ ID NO: 1 7); 

(m) KKGACMRGEWEWSWLC (SEQ ID NO: 18); 

(n) KKGACPPLNKEWAWLC (SEQ ID NO: 19); 

(o) a D-peptide comprising WXWL (SEQ ID NO: 23); 

(p) a D-peptide comprising EWXWL (SEQ ID NO: 24); 

(q) a D-peptide comprising CXXXXXEWXWL (SEQ ID NO: 12) 

(r) ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 

(r) ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 38); 

(t) ac-KKKKGACEARHREWAWLCAA-am (SEQ ID NO: 43); 

(u) ac-GACGLGQEEWFWLCAA-am (SEQ ID NO: 44); 

(v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ID NO: 1 5); 

(w) ac-KKKKGACGLGQEEWFWLCAA-am (SEQ ID NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKGACDLKAKEWFWLCAA-am (SEQ ID NO: 39); 

(z) ac-ICKXKGACDLKAKEWFWLCAA-am (SEQ ID NO: 46); 

(a') ac-GACELLGWEWAWLCC-am (SEQ ID NO: 47); 

(b') ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 16); 

(c') ac-KKKKGACELLGWEWAWLCAA-am (SEQ ID NO: 17); 



WO 00/06599 



PCT/US99/17351 



-81- 

(d') ac-GACSRSQPEWEWLCAA-am (SEQ ID NO: 36); 

(e') ac-KKGACSRSQPEWEWLCAA-am (SEQ ID NO: 40); 

(f ) ac-KKKKG ACSRSQP EWE WLC AA-am (SEQ ID NO: 48); 

(g') ac-GACLLRAPEWGWLCAA-am (SEQ ID NO: 37); 

5 (h') ac-KKGACLLRAPEWGWLCAA-am (SEQ ED NO: 41); 

(i') ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ID NO: 49); 

0 ') ac-GACMRGEWEWS WLC AA-am (SEQ ID NO: 50); 

(k') ac-KKGACMRGEWEWSWLCAA-am (SEQ ID NO: 1 8); 

(T) ac-KKKKGACMRGEWEWSWLCAA-am (SEQ ID NO: 5 1); 

10 (m') ac-GACPPLNKEWAWLCAA-am (SEQ ID NO: 52); 

(n') ac-KKGACPPLNKEW AWLCAA-am (SEQ ID NO: 1 9); 

(o') ac-KKKKGACPPLNKEWAWLCAA-am (SEQ ID NO: 53); 

(p') ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 54); 

(q')* ac-KKGACXXXXXEWXWLCAA-am (SEQ ID NO: 55); 

15 (r ') ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID NO: 56); 

(s') ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 57); 
(f) ac-KKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 58); 
(u') ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ID NO: 59); 
(v') ac-XXCXXXXXEWXWLCXXX-am (SEQ ID NO: 60); 
20 (W) ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 61); 

(x') ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 62); and 
(y') a variant of a sequence of (a) through (x'), wherein the variant binds 
the N-helix coiled-coil cavity of HIV gp4l, wherein ac- at the C- 
terminus and -am at the N-terminus are optional. 

25 46. A method of identifying a drug that binds the N-helix coiled-coil cavity of 
HIV gp41 comprising: 

(a) combining: ( 1 ) a candidate drug to be assessed for its ability to bind 
the N-helix coiled-coil cavity of HIVgp41 and; (2) a fusion protein 
which comprises a trimeric version of the coiled-coil region of a 
30 protein and a sufficient portion of the N-peptide of HTV gp41 to 
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include the HIV gp41 cavity, under conditions appropriate for 
presentation of the HIV gp41 cavity for binding by a drug; and 
(b) determining whether the candidate drug binds the HIV gp41 cavity, 
wherein if binding occurs, the candidate drug is a drug which binds 
5 the N-helix coiled-coil cavity of HTV gp4 1 . 

47. The method of Claim 46 wherein in (a), a peptide which binds the N-helix 
coiled-coil cavity of HIV gp41 is combined with the candidate drug and the 
fusion protein and in (b), whether the candidate drug binds the HIV gp41 
cavity is determined in the presence of the peptide which binds the N-helix 

1 0 coiled-coil cavity of HIV gp41 . 

48. The method of Claim 42 wherein the peptide which binds the N-helix 
coiled-coil cavity of HIV gp41 is selected from the group consisting of: 



(a) 


CDLKAKEWFWLC (SEQ ID NO: 3); 


(b) 


CEARHREWAWLC (SEQ ID NO: 4); 


(c) 


CELLGWEWAWLC (SEQ ID NO: 5); 


(d) 


CLLRAPEWGWLC (SEQ ID NO: 6); 


(e) 


CSRSQPEWEWLC (SEQ ID NO: 7); 


CO 


CGLGQEEWFWLC (SEQ ID NO: 8); 


(g) 


CMRGEWEWSWLC (SEQ ID NO: 9); 


(h)_ 


CPPLNKEWAWLC (SEQ ID NO: 10); 


(i)~ 


CVLKAKEWFWLC (SEQ ID NO: 1 1); 


G) 


KKGACGLGQEEWFWLC (SEQ ID NO: 15); 


GO 


KKGACELLGWEWAWLC (SEQ ID NO: 16); 


(0 


KKKKGACELLGWEWAWLC (SEQ ID NO: 17); 


(m) 


KKGACMRGEWEWSWLC (SEQ ID NO: 18); 


(n) 


KKGACPPLNKEWAWLC (SEQ ID NO: 19); 


(o) 


a D-peptide comprising WXWL (SEQ ID NO: 23); 


(P) 


a D-peptide comprising EWXWL (SEQ ID NO: 24); 


(q) 


a D-peptide comprising CXXXXXEWXWL (SEQ ID NO: 12) 
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(r) ac-GACEARHREWAWLCAA-am (SEQ ID NO: 34); 

(r) ac-KKGACEARHREWAWLCAA-am (SEQ ID NO: 38); 

(t) ac-KKKKGACEARHREWAWLCAA-am (SEQ ID NO: 43); 

(u) ac-GACGLGQEEWFWLCAA-am (SEQ ID NO: 44); 

5 (v) ac-KKGACGLGQEEWFWLCAA-am (SEQ ED NO: 15); 

(w) ac-KKKKGACGLGQEEWFWLCAA-am (SEQ ID NO: 45) 

(x) ac-GACDLKAKEWFWLCAA-am (SEQ ID NO: 35); 

(y) ac-KKGACDLKAKEWF WLCAA-am (SEQ ID NO: 39); 

(z) ac-KKKKGACDLKAKEWF WLCAA-am (SEQ ID NO: 46); 

[ 0 (a') ac-GACELLGWEWAWLCC-am (SEQ ED NO: 47); 

(b') ac-KKGACELLGWEWAWLCAA-am (SEQ ID NO: 16); 

(c') ac-KKKKGACELLGWEWAWLCAA-am (SEQ ID NO: 17); 

(d') ac-GACSRSQPEWEWLCAA-am (SEQ ED NO: 36); 

(e') ac-KKGACSRSQPEWEWLCAA-am (SEQ ID NO: 40); 

15 (f ) ac-KKKKGACSRSQPEWEWLC AA-am (SEQ ED NO: 48); 

(g') ac-GACLLRAPEWGWLCAA-ara (SEQ ED NO: 37); 

(h') ac-KKGACLLRAPEWGWLCAA-am (SEQ ED NO: 41); 

(i') ac-KKKKGACLLRAPEWGWLCAA-am (SEQ ED NO: 49); 

(j ') ac-GACMRGEWEWS WLCAA-am (SEQ ED NO: 50); 

20 (k') ac-KKGACMRGEWEWSWLCAA-am (SEQ ID NO: 1 8); 

(1') ac-KKKKGACMRGEWEWSWLC AA-am (SEQ ED NO: 5 1); 

(m') ac-GACPPLNKEW A WLCAA-am (SEQ ED NO: 52); 

(n~) ac-KKGACPPLNKEW A WLCAA-am (SEQ ID NO: 19); 

(o') ac-KKKKGACPPLNKEWA WLCAA-am (SEQ ID NO: 53); 
25 (p') ac-GACXXXXXEWXWLCAA-am (SEQ ID NO: 54); 

(q') ac-KKGACXXXXXEWXWLCAA-am (SEQ ED NO: 55); 

(r') ac-KKKKGACXXXXXEWXWLCAA-am (SEQ ID NO: 56); 

(s') ac-XXCXXXXXEWXWLCXX-am (SEQ ID NO: 57); 

(f) ac-KKXXCXXXXXEWXWLCXX-am (SEQ ED NO: 58); 
30 (u') ac-KKKKXXCXXXXXEWXWLCXX-am (SEQ ED NO: 59); 

(v') ac-XXCXXXXXEWXWLCXXX-am (SEQ ED NO: 60); 
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(w f ) ac-KKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 61); 
(x') ac-KKKKXXCXXXXXEWXWLCXXX-am (SEQ ID NO: 62); and 
(y ') a variant of a sequence of (a) through (x')> wherein the variant binds 
the N-helix coiled-coil cavity of HIV gp41, wherein ac- at the C- 
5 terminus and -am at the N-terminus are optional. 

49. The method of Claim 46 wherein the candidate drug is detectably labeled 
and binding of the candidate drug to the HIV gp41 cavity is determined by 
detecting the presence of the detectable label on the HIV gp41 cavity. 

50. The method of Claim 46 wherein the fusion protein comprises a soluble, 

10 trimeric version of the coiled-coil region of GCN4 and a sufficient portion of 

the N-peptide of HIV gp41 to include the HIV gp41 cavity. 

5 1 . The method of Claim 50 wherein the fusion protein is IQN17 or a variant 
thereof, wherein the amino acid sequence of IQN17 is SEQ ED NO.: 2. 

52. A method of identifying a drug that binds the N-helix coiled-coil cavity of 
1 5 HIV gp4 1 comprising: 

(a) combining: (1) a soluble model that presents the N-helix coiled-coil 
cavity of HIV gp 41 in such a manner that it is available for binding 
by a drug and (2) a candidate drug, which is to be assessed for its 
ability to bind the N-helix coiled-coil cavity; and 
20 (b) determining whether the candidate drug binds the N-helix coiled coil 

cavity of the soluble model, 

wherein if binding occurs, the candidate drug is a drug which binds the 

N-helix coiled-coil cavity of HIV gp41. 



53. 

25 



A method of producing a drug that binds the N-helix coiled-coil cavity of 

HIV gp41 and inhibits HIV entry into cells, comprising 

(a) combining (1) a candidate drug to be assessed for its ability to bind 
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the N-helix coiled-coil cavity of HIV gp41 and inhibit HIV entry into 
cells and (2) a fusion protein which comprises a trimeric version of 
the coiled-coil region of a protein and a sufficient portion of the 
N-peptide of HIV gp41 to include the HIV gp41 cavity, under 
5 conditions appropriate for presentation of the HIV gp41 cavity for 

binding by a drug; 

(b) determining whether the candidate drug binds the HIV gp41 cavity, 
wherein if binding of the candidate drug to the N-helix coiled-coil 
cavity of HIV gp41, occurs, the candidate drug is a drug which binds 

I o the N-helix coiled-coil cavity of HIV gp4 1 , whereby a drug which 

binds the N-helix coiled-coil cavity of HIV gp41 is produced; and 

(c) assessing the ability of the drug produced in (b) to inhibit HIV entry 
into cells, wherein if the drug inhibits HIV entry into cells, it is a drug 
which binds the N-helix coiled-coil cavity of HIV gp41 and inhibits 

15 HIV entry into cells. 

54. The method of Claim 53 wherein the fusion protein of (a)(2) comprises a 
soluble, trimeric coiled-coil region of GCN4 and a sufficient portion of the 
N-peptide of HIV gp41 to include the HIV gp41 cavity and the ability of the 
drug produced in (b) to inhibit HIV entry into cells is assessed in a 

20 syncytium assay, an infection assay or both. 

55 . The method of Claim 54 wherein the drug identified in (c) is further assessed 
for its ability to inhibit HIV entry into cells by in vivo assessment in an 
appropriate animal model. 

56. The method of Claim 54 wherein the fusion protein is IQN1 7 or a variant 
25 thereof, wherein the amino acid sequence of IQN1 7 is SEQ ID NO.:2. 



57. 



A method of producing a soluble model of the N-helix coiled-coil cavity of 
HIV gp41, comprising producing a fusion protein comprising: (a) a soluble, 



WO 00/06599 



PCTAJS99/17351 



-86- 



trimeric form of a coiled-coil and (b) a sufficient portion of the N-peptide 
region of HIV gp41 to comprise the amino acid residues which form the 
pocket of the N-helix coiled-coil of HIV gp41. 

58. The method of Claim 57 wherein the protein of (a) is GCN^pIgl, GCN4-pn, 
5 Moloney Murine Leukemia Virus or ABC heterotrimer and the sufficient 

portion of (b) is selected from the group consisting of a portion comprising 
SEQ ID NO: 20; a portion comprising SEQ ID NO: 26; a portion comprising 
SEQ ID NO: 27 and a portion comprising SEQ ID NO: 42. 

59. The method of Claim 57 wherein the fusion protein is IQN17 or a variant 
10 thereof, wherein the amino acid sequence of IQN1 7 is SEQ ID NO: 2. 

60. A method of producing a drug that binds the N-helix coiled-coil cavity of 
HIV gp41 comprising, 

(a) producing or obtaining a soluble model of the N-helix coiled-coil 
cavity of HIV gp41; 

15 (b) combining: (1) a candidate drug to be assessed for its ability to bind 

the N-helix coiled-coil cavity of HIV gp41 and (2) the soluble model 
of the N-helix coiled-coil cavity of HIV gp41; and 
(c) determining whether the candidate drug binds the N-helix coiled-coil 
cavity of HIV gp41, 

20 wherein if the candidate drug binds the N-helix coiled-coil cavity of HIV 

gp41, the candidate drug is a drug which binds the N-helix coiled-coil cavity 
of HIV gp41, whereby a drug which binds the N-helix coiled-coil cavity of 
HIV gp41 is produced. 

61 . The method of Claim 60 wherein the soluble model is a fusion protein which 
25 comprises a trimeric version of the coiled-coil region of a protein and a 

sufficient portion of the N-peptide of HIV gp41 to include the HIV gp41 
cavity. 
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62. The method of Claim 6 1 wherein the fusion protein is IQN1 7 or a variant 
thereof, wherein the amino acid sequence of IQN17 is SEQ ID NO.:2. 

63. A method of producing a drug that binds the N-helix coiled-coil cavity of 
HIV gp41 and inhibits its entry into cells, comprising; 

5 (a) producing or obtaining a soluble model of the N-helix coiled-coil 

cavity of HIV gp41; 
(b) combining: ( 1 ) a candidate drug to be assessed for its ability to bind 
the N-helix coiled-coil cavity of HTV gp41 and (2) the soluble model 
of the N-helix coiled-coil cavity of HIV gp41; 

! o ( C ) determining whether the candidate drug binds the N-helix coiled-coil 

cavity of HIV gp41, wherein if the candidate drug binds the N-helix 
coiled-coil cavity of HIV gp41, the candidate drug is a drug which 
binds the N-helix coiled-coil cavity of HTV gp41, whereby a drug 
which binds the N-helix coiled-coil cavity of HIV gp41 is produced 

15 and; 

(d) assessing the ability of the drug produced in (c) to inhibit HTV entry 

into cells, 

wherein if the drug inhibits HIV entry into cells, it is a drug which binds the 
N-helix coiled-coil cavity of HTV gp41 and inhibits HIV entry into cells. 

20 64. The method of Claim 63 wherein the soluble model is a fusion protein which 
comprises a trimeric version of the coiled-coil region of a protein and a 
sufficient portion of the N-peptide of HTV gp41 to include the HIV gp41 
cavity. 

65. The method of Claim 64 wherein the fusion protein is IQN17 or a variant 
25 thereof, wherein the amino acid sequence of IQN1 7 is SEQ ID No.;2. 



66. 



A drug produced by the method of Claim 60. 
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67. A drug produced by the method of Claim 6 1 . 

68. A drug produced by the method of Claim 62. 

69. A drug produced by the method of Claim 63. 

70. A drug produced by the method of Claim 64. 

5 71 . A drug produced by the method of Claim 65. 

72. A method of identifying a peptide that binds to the N-helix coiled-coil cavity 
of HIV gp41, comprising: 

(a) combining IQN1 7 in the D-handedness with a phage display library 
of L- amino acid peptides, under conditions appropriate for binding of 

1 0 members of the library to IQN1 7 in the D-handedness; and 

(b) determining if binding occurs between IQN1 7 in the D-handedness 
and a member or members of the phage display library, wherein if 
binding occurs, a peptide that binds to the N-helix coiled-coil cavity 
of HIV gp41 in the D-handedness is identified. 

15 73. The method of Claim 72 further comprising determining the amino acid 

sequence of the member or members of the phage display library which bind 
to IQN1 7 in the D-handedness and producing peptides, in D form, 
comprising the amino acid sequences determined, wherein the peptides in D 
form bind the N-helix coiled-coil cavity in the natural L-handedness. 



20 74. 



A compound of Formula I, 
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wherein 

A is a D- amino acid residue or an N-substituted glycyl residue of the 
formula 



Rm W 

I I 
-N C C 



R A2 



wherein one of R A1 and is a substituted or unsubstituted aryl, 
heteroaryl, arylmethyl, heteroarylmethyl, benzo-fused aryl, benzo- 
fused heteroaryl, benzo-fused arylmethyl, benzo-fused 
heteroarylmethyl, cycloalkyl or bicycloalkyl; and the other is 
!0 hydrogen; and W is hydrogen, methyl, trifluoromethyl or halogen, for 

example, fluorine, chlorine, bromine or iodine; 

B is a glycyl residue or a D-amino acid or N-substituted glycyl residue of the 
formula 



Rbi X ° 



I 

-N C C 



Rb2 
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wherein one of R B1 and is a substituted or unsubstituted linear, 
branched or cyclic alkyl, aryl, arylalkyl, heteroaryl or heteroarylalkyl 
group; and the other is hydrogen; and X is hydrogen, methyl, 
trifluoromethyl or halogen, such as fluorine, chlorine, bromine or 
5 iodine; 

D is a D- amino acid residue or N-substituted glycyl residue of the formula 




wherein one of R DI and R D2 is a substituted or unsubstituted aryl, 
heteroaryl, arylmethyl, heteroarylmethyl, benzo-fused aryl, benzo- 
10 fused heteroaryl, benzo-fused arylmethyl; benzo-fused 

heteroarylmethyl, cycloalkyl or bicycloalkyl; and the other is 
hydrogen; andY is hydrogen, methyl, trifluoromethyl or halogen, 
such as fluorine, chlorine, bromine or iodine; 

E is a D-amino acid residue or N-substituted glycyl residue of the formula 




wherein one of R E , and R E2 is a substituted or unsubstituted, linear, 
branched or cyclic alkyl, aryl or arylalkyl group; and the other is 
hydrogen; and Z is hydrogen, methyl, trifluoromethyl or halogen, 
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such as fluorine, chlorine, bromine or iodine; 

K, L, M and N are each, independently, an amino acid residue or a 
polypeptide group comprising 2 to about 8 amino acid residues; 
F is a direct bond or a (Afunctional linking group; and 
5 n, p, q, r and s are each, independently, 0 or 1. 

75. The compound of Claim 74 wherein one of R AI and and one of R DI and 
Rb 2 are, independently, a phenyl, substituted phenyl, naphthyl, substituted 
naphthyl, naphthylmethyl, substituted naphthylmethyl, benzyl or substituted 
benzyl group, or a group of the formula 




where J is 0, S or NR, where R is H or linear, branched or cyclic C r 
C 6 -alkyl; and 

R„ R 2 , R 3 , R 4 and Rj are independently selected from the group 
15 consisting of 

hydrogen, halogen and alkyl. 

76. The compound of Claim 75 wherein R Al and R D! are both hydrogen. 
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77. The compound of Claim 74 wherein one of R B1 and R^ is hydrogen, 
substituted or unsubstituted linear, branched or cyclic C,-C 4 -alkyl, phenyl, 
benzyl, naphthyl or naphthylmethyl. 

78. The compound of Claim 77 wherein R BI is hydrogen. 

5 79. The compound of Claim 74 wherein one of R E1 and R K is a substituted or 

unsubstituted, linear, branched or cyclic C r C 6 -alkyl group or a substituted or 
unsubstituted phenyl or naphthyl group and the other is hydrogen. 

80. The compound of Claim 79 wherein R EI is hydrogen. 

8 1 . The compound of Claim 74 wherein A and D are each a D-tryptophan 
10 residue and E is a D-leucine residue. 

82. The compound of Claim 74 wherein K is a D-amino acid residue or an N- 
substituted glycyl residue comprising an amino-, carboxyl- or sulfhydryl 
substituted side chain and L is a polypeptide comprising 2 or 3 D-amino acid 
residues or N-substituted glycine residues. 

15 83. The compound of Claim 74 wherein M is a polypeptide group comprising 
from 2 to about 8 D-amino acid residues, of which at least one comprises an 
amino-, carboxy- or sulfhydryl substituted side chain, and N is a polypeptide 
group comprising from 1 to about 6 amino acid residues, of which at least 
one is a lysine residue. 

20 84. The compound of Claim 74 wherein F is a divalent linking group having a 
length from about 2 to about 40 atoms. 



85. 



The compound of Claim 84 wherein F is a polypeptide linking group of the 
formula -P n -, wherein n is an integer from 1 to about 12 and each P is 
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independently an L- or D- amino acid or N-substituted glycyl residue, a 
glycyl residue or an N-substituted glycyl residue. 

86. The compound of Claim 84 wherein F is a substituted or unsubstituted C 4 - 
CValkylene group or a Q-C^-alkylene group which is interrupted at one or 

5 more points by a heteroatom, a phenylene group or a heteroarylene group. 

87. The compound of Claim 84 wherein F is a polysaccharide group comprising 
from 1 to about 10 glycoside groups. 

88. A method of producing a drug which fits the N-helix coiled-coil pocket of 
HIV gp41 , comprising: 

] o ( a ) obtaining a crystal of a soluble, trimeric peptide model of the HTV 

gp41 hydrophobic pocket; 

(b) obtaining the atomic coordinates of the peptide model by X-ray 
diffraction studies using the crystal obtained in (a); 

(c) using the atomic coordinates obtained in (b) to define the N-helix 
1 5 coiled-coil pocket of HIV gp4 1 ; 

(d) identifying a molecule or compound which fits the N-helix coiled- 
coil pocket of HIV gp41; 

(e) obtaining the molecule or compound identified in (d); and 

(f) contacting the molecule or compound obtained in (e) with the N-helix 
20 coiled-coil pocket of HIV gp41 to assess the ability of the molecule 

or compound to fit the pocket of HIV gp41 , 
wherein if the molecule or compound fits the N-helix coiled-coil pocket of 
HTV gp4l, the molecule or compound is a drug which fits the pocket, 
whereby a drug which fits the N-helix coiled-coil pocket of HIV gp41 is 
25 produced: 

89. The method of Claim 88 wherein the soluble, trimeric peptide molecule 

comprises a soluble, trimeric form of a coiled coil and a sufficient portion of 
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the N-peptide region of HIV gp41 to comprise the amino acid residues which 
form the pocket of the N-helix coiled-coil of HIV gp41. 

90. The method of Claim 89 wherein in (f), the molecule or compound is 
contacted with the N-helix coiled-coil pocket of HIV gp41 by contacting the 

5 molecule or compound with IQN17, the N-helix of HIV gp41 or a 

polypeptide which comprises the HIV pocket. 

91 . The method of Claim 89 wherein the soluble model is IQN17. 

92. The method of Claim 88 wherein the crystal obtained in (a) is a crystal of 
IQN17 of space group C222. 

10 93. A method of producing a drug which binds the N-helix coiled-coil pocket of 
HIV gp41, comprising: 

(a) obtaining the atomic coordinates of IQN 1 7 ; 

(b) using the atomic coordinates obtained in (a) to define the N-helix 
coiled-coil pocket of HIV gp41 ; 

1 5 ( C ) identifying a molecule or compound which fits the N-helix coiled- 

coil pocket of HIV gp41; 

(d) obtaining the molecule or compound identified in (c); and 

(e) contacting the molecule or compound obtained in (d) with the N- 
helix coiled-coil pocket of HIV gp41 to assess the ability of the 

20 molecule or compound to fit the pocket of HIV gp41 , 

wherein if the molecule or compound fits the N-helix coiled-coil pocket of 
HIV gp4l, the molecule or compound is a drug which fits the pocket, 
whereby a drug which fits the N-helix coiled-coil pocket of HIV gp41 is 
produced. 



25 94. 



The method of Claim 93 wherein the atomic coordinates are the atomic 
coordinates in the PDB file represented in Figures 1 1 A- 11 V. 
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95. A method of identifying a molecule that binds to the N-helix coiled-coil 
cavity of HIV gp41, comprising: 

(a) combining IQN1 7 in the D-handedness with a biologically encoded 
library of ligands, under conditions appropriate for binding of 

5 members of the library to IQN1 7 in the D-handedness; and 

(b) determining if binding occurs between IQN1 7 in the D-handedness 
and a member or members of the biologically encoded library, 
wherein if binding occurs, a ligand that binds to the N-helix coiled- 
coil cavity of HIV gp41 in the D-handedness is identified. 

1 0 96. The method of Claim 95 further comprising determining the sequence of the 
member or members of the biologically encoded library which bind to 
IQN17 in the D-handedness, and producing ligands, in the mirror-image 
handedness of the biologically encoded ligands, comprising the sequences 
determined. 



15 97. 



The method of Claim 95 wherein the biologically encoded library is selected 
from the group consisting of a phage display library, a DNA library, an RNA 
library and a biologically encoded peptide library. 
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Figure 1 



SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARIL WMEWDREJNNYTSLIHSUEESQNQQEKNEQELL 

N36 ^ \ C34 




heptad repeat 1 



heptad repeat 2 



tm ( Intra) 
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Figure 3: D-peptide Sequences 



DlOpepl : Ac- GACEARHREWAWLCAA- CONH2 

DIOpopla: Ac - KK GACEARHREWAWLCAA - CONH2 

D10pep3 : Ac - KK GACGLGQEEWFWLCAA- CONH2 

D10pep4 : Ac - GACDLKAKEWFWLCAA - CONH2 

D10pop5 : Ac - KK GACELLGWEWAWLCAA- CONH2 

DIOpepSa: Ac - KKKK GACELLGWEWAWLCAA- CONH2 

D10pep6 : Ac -GACSRSQPEWEWLCAA- CONH2 

D10pep6a : Ac - KK GACSRSQPEWEWLCAA- CONH2 

D10pep7a: Ac - KK GACLLRAP EWGWLCAA - CONH2 

DlOpeplO: Ac - KK GACMRGEWEWSWLCAA- CONH2 

Dl0pepX2: Ac -KKGACPPLNKEWAWLCAA- CONH2 

Consensus Sequence CXXXXXEWXWLC 



Where: 

G = glycine 

A = alanine 

C = cysteine 

D = aspartic acid 

L = leucine 

K » lysine 

E s glutamic acid 

W = tryptophan 

F = phenylalanine 

R = arginine 

H = histidine 

S = serine 

Q = glut amine 
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Figure 4. 



1. Perform rounds of phage selection to identify binders to D-IQN17. 



D-IQN17 



Phage Library: 

c/sxxxxxxxxxxc/s 



•linker with trypsin site 
■biotin 



2. Sequence individual phage clones 

3. Test for specificity of binding. Determine if the phage bind to the gp41 
region of D-IQN17. 

D-IQN17 

D-JQN17 blank D-GCN4-pl Q i (G36W ) 



iTiYiM. 



4. Synthesize D-peptides. 

5. Assay anti-HIV activity of D-peptides. 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06599 



PCT/US99/17351 



5/45 



Relationship of D-peptides to IQN17 



Figure 5A 



Figure 5B 
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Figure 6B 



D-Peptide Approximate IC 50 Value 

(from one or more experiments) 

DlOpepI 2x1(T 5 M 

D10pep1A 3x1<T 5 M 

Dl0pep3 1 x1(T 5 M 

D10pep4 3x10' 5 M 

DlOpepS 3x1(T 6 M 

DlOpepSa . 6x1CT 6 M 

D10pep6 3x10" 5 M 

Dl0pep7a 4x1CT 5 M 

DpepIO 6x1(T 5 M 

Dpep12 2x1(T 4 M 



D1 0pep3 show anti-viral effects 
D10pep4 \ with IC 50 values of 
DlOpepS J less than 1 x 10" 4 M. 
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REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK • 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK* 

REMARK 

REMARK 



REFINEMENT. 
PROGRAM 
AUTHORS 



CNS 0.5 

SRUNGER , ADAMS, CLORE. DELANO, 
GROS. GROSSE - KUNSTLEVE , JIANG, 
KUSZEWSKI , NILGES, PANNU, READ, 
RICE, SIMONSCN, WARREN 



DATA USED IN REFINEMENT. 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA(F)) 
DATA CUTOFF HIGH (ABS(F)) 
DATA CUTOFF LOW (ABS(F) ) 

COMPLETENESS (WORKING+TEST) (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT. 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 
ESTIMATED ERROR OF FREE R VALUE 



1.50 
10.00 
0.0 

646169.44 
0.000000 
94.6 
13549 



THROUGHOUT 

RANDOM 

0.214 

0.245 

10.1 

1362 
0.007 



FIT IN THE HIGHEST RESOLUTION 3IN. 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE HIGH (A) 
BIN RESOLUTION RANGE LOW (A) 
BIN COMPLETENESS (WORKING+TEST) (%) 
REFLECTIONS IN BIN (WORKING SET) 
BIN R VALUE (WORKING SET) 

BIN FREE R VALUE 

BIN FREE R VALUE TEST SET SIZE (%) 
BIN FREE R VALUE TEST SET COUNT 
ESTIMATED ERROR OF BIN FREE R VALUE 



6 

1.50 
1.59 
96.1 

2008 
0.233 
0.270 

9.8 
219 
0.018 



NUMBER OF NON-HYDROGEN ATOMS USED IN REFINEMENT. 



PROTEIN ATOMS 
NUCLEIC ACID ATOMS ■ 
HETEROGEN ATGMS 
SOLVENT ATOMS 



B VALUES. 
FROM WILSON PLOT (A'*2> 
MEAN 3 VALUE (OVERALL, A**2) 

OVERALL ANISOTROPIC 3 VALUE. 



21.6 
29.7 



Bll (A**2) 
322 
333 
312 
312 
323 



(A**2) 
lA**2) 
(A* "2 J 
(A**2) 
iA'*2) 



BULK SOLVENT 
METHOD USED 
KSCL 



3.61 
3.61 
-7.22 
1.74 
0.00 
0.00 

MODELING . 
FLAT MODEL 
0.394054 



Figure 7A 
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REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK- 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 
SEQRES 

cryst: 

ORIGXl 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
1 
2 
3 
4 
5 
6 

8 
9 
10 
11 
12 
13 
14 
15 
16 
17 



BSOL 



58.3445 (A"2) 



ESTIMATED COORDINATE ERROR. 

ESD FROM LUZZATI PLOT (A) : 0.18 

ESD FROM SIGMAA (A) : 0.0S 

LOW RESOLUTION CUTOFF (A) : 5.00 

CROSS- VALIDATED ESTIMATED COORDINATE ERROR. 



ESD FROM C-V LUZZATI PLOT 
ESD FROM C-V SIGMAA 



(A) 
(A) 



0.20 
0.12 



RMS DEVIATIONS FROM IDEAL VALUES. 

BOND LENGTHS (A) : 0.012 

BOND ANGLES (DEGREES) : 1.5 

DIHEDRAL ANGLES (DEGREES) : 15.7. 

IMPROPER ANGLES (DEGREES) : 1.00 

ISOTROPIC THERMAL MODEL : RESTRAINED 

ISOTROPIC THERMAL FACTOR RESTRAINTS. 



MAIN-CHAIN BOND 
MAIN-CHAIN ANGLE 
SIDE-CHAIN BOND 
SIDE-CHAIN ANGLE 

NCS MODEL : NONE 

NCS RESTRAINTS. 
GROUP 1 POSITIONAL 
GROUP 1 B-FACTOR 



(A* 
(A* 
(A* 
(A 4 



*2) 
*2> 
*2) 
*2) 



(A) 
(A**2) 



RMS 
0.956 
1.503 
1.853 
2.676 



RMS 
NULL 
NULL 



SIGMA 

2.0 

3.0 

3.0 

3.5 



SIGMA/WEIGHT 
; NULL 
; NULL 



PARAMETER FILE 
PARAMETER FILE 
PARAMETER FILE 
TOPOLOGY FILE 
TOPOLOGY FILE 
TOPOLOGY FILE 



1 
2 
3 

i 

2 
3 



protein_rep_d . param 
CNS_TOPPAR/water_rep .param 
CNSJTOPPAR/ ion . param 
CNSJTOPPAR/ protein, cop 
CNSJTOPPAR/water . top 
CNSJTOPPAR/ ion . top 



OTHER REFINEMENT REMARKS: NULL 

A 214 ACE ARG MET LYS GLN ILE GLU ASP LYS ILE GLU GLU ILE 

A 214 GLU SER LYS GLN LYS LYS ILE GLU ASN GLU ILE ALA ARG 

A 214 ILE LYS LYS LEU LEU GLN LEU THR VAL TRP GLY ILE LYS 

A 214 GLN LEU GLN ALA ARG ILE LEU ACE DLY DLA DCS DLU DLA 

A 214 DRG DIS DRG DLU DRP DLA DRP DEU DCS DLA DLA CL WAT 

A 214 WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT 

A 214 WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT 

A 214 WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT 

A 214 WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT 

A 214 WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT 

A 214 WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT 

A 214 WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT WAT 
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Figure 10: Conformation of D10pep1 in complex with IQN17 
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